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Title: Compositions and Methods for Regulating the Kinase Domain of Receptor Tyrosine Kinases 

A portion of the disclosure of this patent document contains material that is subject to copyright 

protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent 

document or patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but 
5 otherwise reserves all copyright rights whatsoever. 

FIELD OF THE INVENTION 

The present invention relates to binding pockets of receptor tyrosine kinases (RTKs). The binding 

pockets may regulate the kinase domain of the receptor tyrosine kinases. In particular, the invention relates 

to a crystal comprising a binding pocket of a receptor tyrosine kinase that regulates the kinase domain of 
10 the receptor tyrosine kinase. The crystal may be useful for modeling and/or synthesizing mimetics of a 

binding pocket or ligands that associate with the binding pocket. Such mimetics or ligands may be capable 

of acting as modulators of receptor tyrosine kinase receptor activity, and they may be useful for treating, 

inhibiting, or preventing diseases modulated by such receptors. 

Methods are also provided for regulating the kinase domain of an RTK by changing a binding 
15 pocket of the RTK that regulates the kinase domain from an autoinhibited state to an active state or from 

an active state to an autoinhibited state. 

BACKGROUND 

Cell surface receptors with protein-tyrosine kinase activity mediate the biological effects of many 
extracellular signaling proteins, and thereby regulate aspects of normal cellular behavior such as growth 

20 and differentiation, movement, metabolism and survival (van der Geer and Hunter, 1994). The profound 
consequences of phosphotyrosine signaling on cellular function are emphasized by the effects of mutations 
that deregulate receptor tyrosine kinase activity, which are frequently associated with malignant 
transformation or developmental abnormalities. Under normal circumstances, the activation of receptor 
tyrosine kinases (RTK) requires binding of the appropriate extracellular ligand, which induces either 

25 receptor oligomerization or a spatial re-organization of pre-associated receptor chains (Heldin, 1995; Remy 
et aL, 1999; Schlessinger, 2000). As a result, the receptor undergoes autophosphorylation through an 
intermolecular reaction, both on tyrosine residues which regulate kinase activity, and on residues within 
non-catalytic regions of the receptor which form binding sites for cytoplasmic targets with SH2 or PTB 
domains (Pawson and Scott, 1997; Kuriyan and Cowburn, 1997). 

30 The catalytic activity of tyrosine kinases is frequently stimulated by autophosphorylation within a 

region of the kinase domain termed the activation segment (Weinnmaster et aL, 1984), and indeed this has 
been viewed as the principal mechanism through which RTKs are activated (Hubbard and Till, 2000; 
Hubbard, 1997). Structural analysis of the isolated kinase domains of several receptors has revealed how 
the activation segment represses kinase activity, and the means by which phosphorylation releases this 

35 autoinhibition. In the case of the inactive insulin receptor, Tyr 1 162'in the activation segment protrudes 
into the active site, and the activation segment blocks access to the ATP-binding site (Hubbard et aL, 
1994). Autophosphorylation of Tyr 1162 and two adjacent tyrosine residues repositions the activation 
segment, thereby freeing the active site to engage exogenous substrates and reorganizing the residues 
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required for catalysis into a functional conformation (Hubbard, 1997). In contrast, the activation segment 
of the fibroblast growth factor (FGF) receptor is relatively mobile and the tyrosines which become 
phosphorylated upon receptor activation do not occupy the active site. However, the C-terminal end of the 
FGFR1 activation segment appears to block access to substrate (Mohammadi et aL, 1996). 
5 Despite the evident importance of the kinase domain activation segment, it remains possible that 

other mechanisms are important in regulating RTK activity, which might have been missed through an 
exclusive focus on the kinase domain itself. In particular, recent biochemical and mutational analysis has 
suggested that Eph receptors may be regulated through a more complex mechanism, involving the 
juxtamembrane region (Bums et al., 2000; Zisch et al., 1998; Zisch et al., 2000). 

10 There is only a single Eph receptor tyrosine kinase encoded by the C. elegans genome (VAB-1) 

(George et al., 1998; Wang et al., 1999a), but the subfamily has undergone a remarkable expansion during 
metazoan evolution to include at least 14 mammalian members, which therefore represent the largest class 
of vertebrate RTKs (Holder and Klein, 1999). These Eph receptors fall into two groups, A and B, based on 
their ability to bind ligands (ephrins), which are themselves cell surface proteins anchored to the plasma 

15 membrane either through a GPI linkage (A-type ephrins) or a transmembrane region (B-type) (Eph 
Nomenclature Committee, 1997; Gale et al., 1996). Signaling between Eph receptors and ephrins generally 
involves direct cell-cell interactions (Holland et al., 1996; Bruckner et al., 1997), and frequently results in 
the repulsion of these cells one from another (Drescher et al., 1995; Wang and Anderson, 1997; Mellitzer 
et al., 1999). Eph receptors are implicated in morphogenetic cell movements (Wang et al., 1999a; Chin- 

20 Sang et al., 1999), in defining cell boundaries in structures such as the rhombomeres of the embryonic 
hindbrain (Xu et al., 1999), in controlling axon guidance and the establishment of topographic maps in the 
central nervous system (Nakamoto et al., 1996; Brown et al, 2000), and in determining the trajectories of 
migrating neural crest cells (Krull et al., 1997). Signaling between ephrin and Eph receptor-expressing 
cells is also essential for angiogenesis, and in conferring distinct arterial and venous identities to 

25 developing blood vessels (Wang et al., 1999b; Adams et al., 1999; Gerety et al., 1999). 

The extracellular region of Eph receptors contains an N-terminal ephrin-binding domain 
(Labrador et al., 1997), that folds into a jellyroll p-sandwich (Himanen et al, 1998), followed by a 
cysteine-rich region and two fibronectin type DDE repeats (Pasquale, 1991; Henkemeyer et al., 1994). A 
single membrane-spanning sequence is followed by a relatively lengthy juxtamembrane region, an 

30 uninterrupted kinase domain, an a-helical sterile alpha motif (SAM) domain implicated in receptor 
oligomerization (Stapleton et al., 1999; Thanos et al., 1999), and a C-terminal motif capable of binding 
PDZ domain proteins (Hock et al, 1998; Torres et al., 1998) . Activation of receptors such as EphB2 or 
EphA4 is accompanied by autophosphorylation on multiple residues, most notably on two tyrosines within 
a highly conserved juxtamembrane motif (YIDPFTYEDP in EphB2) and on a tyrosine within the 

35 activation segment of the kinase domain (Holland et al., 1997; Choi and Park, 1999; Ellis et al, 1996; Kalo 
and Pasquale, 1999; Zisch et al., 1998; Binns et al., 2000). By analogy with other RTKs, it might be 
expected that autophosphorylation of the activation segment tyrosine would stimulate kinase activity, 
while the juxtamembrane phosphotyrosine sites would recruit cytoplasmic targets. Indeed, the 
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juxtamembrane phosphotyrosine motifs do bind SH2 domain signaling proteins, including p!20-RasGAP, 
Nek, phosphatidylinositol S'-kinase, SHEP-1 and Src family kinases among others, which can potentially 
direct cellular responses to ephrin stimulation (Dodelet et al., 1999; Ellis et al., 1996; Holland et al, 1997; 
Holland et al., 1998; Zisch et al, 1998). 
5 Consistent with the possibility that phosphorylation of the conserved juxtamembrane tyrosines is 

important for signaling, substitution of these residues in EphB2 with phenylalanine abrogates EphB2- 
mediated growth cone collapse upon stimulation of NG108 neuronal cells with ephrin Bl. However, this 
loss of biological activity is apparently not due solely to a failure to engage SH2-containing targets, since 
substitution of the juxtamembrane tyrosines in EphB2 and EphA4 with phenylalanine leads to a severe loss 

10 of ephrin-induced kinase activity (Binns et al., 2000). 
SUMMARY OF THE INVENTION 

Applicants have solved the x-ray crystal structure of an Eph receptor tyrosine kinase domain and 
juxtamembrane region in an autoinhibited state. The results show that in its unphosphorylated state, the 
juxtamembrane region adopts a helical structure that distorts the conformation of the small lobe of the 

15 kinase domain, thereby disrupting the active site. These results indicate a novel mechanism for the 
regulation of RTKs. 

Solving the crystal structure has enabled the determination of key structural features of the kinase 
domain and juxtamembrane region, particularly the shape of binding pockets, or parts thereof; that permit 
the juxtamembrane region and kinase domain to associate resulting in an autoinhibited state. The crystal 

20 structure has also enabled the determination of key structural features in molecules or ligands mat interact 
or associate (e.g. nucleotides, cofactors, inhibitors, and substrates) with the binding pockets. 

Knowledge of the autoinhibited conformation of binding pockets of RTKs that regulate the kinase 
domain is of significant utility in drug discovery. The association of natural ligands and substrates with the 
binding pockets of RTKs is the basis of many biological mechanisms. In addition, many drugs exert their 

25 effects through association with the binding pockets of RTKs. The associations may occur with all or any 
parts of a binding pocket. An understanding of the association of a drug with the active and autoinhibited 
conformations of binding pockets of RTKs, will lead to the design and optimization of drugs having more 
favorable associations with their target RTKs and thus provide improved biological effects. Therefore, 
information about the shape and structure of binding pockets of RTKs in their autoinhibited and activated 

30 states, is invaluable in designing potential modulators of the receptors for use in treating diseases and 
conditions associated with or modulated by the receptors. 

The present invention relates to a binding pocket of a receptor tyrosine kinase (RTK). In an aspect 
of the invention, the binding pocket regulates the kinase domain of the receptor tyrosine kinase or is 
involved in mamtaining an autoinhibited state or active state of an RTK. 

35 The invention also relates to a crystal comprising a binding pocket of an RTK that regulates the 

kinase domain of the RTK. The binding pocket may be in an autoinhibited state, or active state. Thus, a 
binding pocket may be involved in maintaining an autoinhibited state or active state of an RTK, 
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In an embodiment, the invention, provides a crystal comprising a juxtamembrane region and/or 
kinase domain of an RTK, or part thereof. The invention contemplates a crystal formed by a 
juxtamembrane region and a kinase domain of an RTK in an autoinhibited state or active state. 

The invention also contemplates a crystal comprising a binding pocket of a receptor tyrosine 
5 kinase that regulates the kinase domain of the receptor tyrosine kinase in association with a ligand 

The present invention also contemplates molecules or molecular complexes that comprise all or 
parts of either one or more binding pockets of the invention, or homologs of these binding pockets that 
have similar structure and shape. 

The present invention also provides a crystal comprising a binding pocket of an RTK of the 
1 0 invention and at least one ligand. A ligand may be complexed or associated with a binding pocket. Ligands 
include a nucleotide or analogue or part thereof, a substrate or analogue thereof, a cofactor, and/or heavy 
metal atom. A ligand may be a modulator of the activity of an RTK. 

In an aspect the invention contemplates a crystal comprising a binding pocket of an RTK of the 
invention complexed with a nucleotide or analogue thereof from which it is possible to derive structural 
1 5 data for the nucleotide or analogue thereof. 

The shape and structure of a binding pocket may be defined by selected atomic contacts in the 
pocket. In an embodiment, the binding pocket is defined by one or more atomic interactions or enzyme 
atomic contacts as set forth in Table 2. Each of the atomic interactions is defined in Table 2 by an atomic 
contact (more preferably, a specific atom where indicated) on the juxtamembrane region and by an atomic 
20 contact (more preferably a specific atom where indicated) on the kinase domain, juxtamembrane region, or 
ligand. 

An isolated polypeptide comprising a binding pocket with the shape and structure of a binding 
pocket described herein is also within the scope of the invention. 

The invention also provides a method for preparing a crystal of the invention, preferably a crystal 
25 of a binding pocket of an Eph receptor, or a complex of such a binding pocket and a ligand. 

Crystal structures of the invention enable a model to be produced for a binding pocket of the 
invention, or complexes or parts thereof. The models will provide structural information about the 
autoinhibited or active state of a binding pocket of a RTK or a ligand and its interactions with a binding 
pocket. Models may also be produced for ligands. A model and/or the crystal structure of the present 
30 invention may be stored on a computer-readable medium. 

The present invention includes a model of a binding pocket of the present invention that 
substantially represents the structural coordinates specified in Table 3. The invention also includes a model 
that comprises modifications of the model substantially represented by the structural coordinates specified 
in Table 3. A modification may represent a binding pocket that is involved in mamtaining an autoinhibited 
35 state or active state of an RTK or regulates the kinase domain of an RTK. A model is a representation or 
image that predicts the actual structure of the binding pocket. As such, a model is a tool that can be used to 
probe the relationship between a binding pocket's structure and function at the atomic level, and to design 
molecules that can modulate the binding site and accordingly RTK activity. 
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Thus, the invention provides a model of: (a) a binding pocket of an RTK that is involved in 
mamtaining an autoinhibited state or active state of an RTK or regulates the kinase domain of an RTK; and 
(b) a modification of the model of (a). 

A method is also provided for producing a model of the invention representing a binding pocket 
5 of an RTK that is involved in maintaining an autoinhibited state or active state of an RTK or regulates the 
kinase domain of an RTK, comprising representing amino acids of the binding pocket at substantially the 
structural coordinates specified in Table 3. 

A crystal and/or model of the invention may be used in a method of determining the secondary 
and/or tertiary structures of a polypeptide or binding pocket with incompletely characterised structure. 
10 Thus, a method is provided for determining at least a portion of the secondary and/or tertiary structure of 
molecules or molecular complexes which contain at least some structurally similar features to a binding 
pocket of the invention. This is achieved by using at least some of the structural coordinates set out in 
Table 3. 

A crystal of the invention may be useful for designing, modeling, identifying, evaluating, and/or 
15 synthesizing mimetics of a binding pocket or ligands that associate with a binding pocket. Such mimetics 
or ligands may be capable of acting as modulators of receptor tyrosine kinase activity, and they may be 
useful for treating, inhibiting, or preventing diseases modulated by such receptors. 

Thus, the present invention contemplates a method of identifying a modulator of an RTK 
comprising the step of applying the structural coordinates of a binding pocket, or atomic interactions, or 
20 atomic contacts of a binding pocket, to computationally evaluate a test ligand for its ability to associate 
with the binding pocket, or part thereof Use of the structural coordinates of a binding pocket, or atomic 
interactions, or atomic contacts of a binding pocket to design or identify a modulator is also provided. 

In an embodiment, the invention contemplates a method of identifying a modulator of an RTK 
comprising determining if a test agent inhibits or potentiates an autoinhibited state or active state of a 
25 kinase domain of the RTK. 

The invention further contemplates classes of modulators of RTKs based on the shape and 
structure of a ligand defined in relation to the molecule's spatial association with a binding pocket of the 
invention. Generally, a method is provided for designing potential inhibitors of RTKs comprising the step 
of applying the structural coordinates of a ligand defined in relation to its spatial association with a binding 
30 pocket, or a part thereof, to generate a compound that is capable of associating with the binding pocket. 

It will be appreciated that a modulator of an RTK may be identified by generating an actual 
secondary or three-dimensional model of a binding pocket, synthesizing a compound, and examining the 
components to find whether the required interaction occurs. 

A potential modulator of an RTK identified by a method of the present invention may be 
35 confirmed as a modulator by synthesizing the compound, and testing its effect on the RTK in an assay for 
that receptor's enzymatic activity. Such assays are known in the art (e.g. phosphorylation assays). 
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A modulator of the invention may be converted using customary methods into pharmaceutical 
compositions. A modulator may be formulated into a pharmaceutical composition containing a modulator 
either alone or together with other active substances. 

Therefore, the methods of the invention for identifying modulators may comprise one or more of 
5 the following additional steps: 

(a) testing whether the modulator is a modulator of the activity of a RTK, preferably testing the 
activity of the modulator in cellular assays and animal model assays; 

(b) modifying the modulator; 

(c) optionally rerunning steps (a) or (b); and 

1 0 (d) preparing a pharmaceutical composition comprising the modulator. 

Steps (a), (b) (c) and (d) may be carried out in any order, at different points in time, and they need 
not be sequential. 

Still another aspect of the present invention provides a method of conducting a drug discovery 
business comprising: 

15 (a) providing one or more systems employing the atomic interactions, atomic contacts, or 

structural coordinates of a binding pocket of an RTK, for identifying agents by their ability 
to inhibit or potentiate the atomic interactions or atomic contacts of a binding pocket; and , 
(b) conducting therapeutic profiling of agents identified in step (a), or further analogs thereof, for 
efficacy and toxicity in animals; and 
20 (c) formulating a pharmaceutical preparation including one or more agents identified in step (b) 

as having an acceptable therapeutic profile. 
A further aspect of the present invention provides a method of conducting a drug discovery 
business comprising: 

(a) providing one or more systems for identifying agents by their ability to inhibit or potentiate 
25 an autoinhibited state or active state of a kinase domain of an RTK; and 

(b) conducting therapeutic profiling of agents identified in step (a), or further analogs thereof, for 
efficacy and toxicity in animals; and 

(c) formulating a pharmaceutical preparation including one or more agents identified in step (b) 
as having an acceptable therapeutic profile. 

30 In certain embodiments, the subject methods can also include a step of establishing a distribution 

system for distributing the pharmaceutical preparation for sale, and may optionally include establishing a 
sales group for marketing the pharmaceutical preparation. 

Yet another aspect of the invention provides a method of conducting a target discovery business 
comprising: 

35 (a) providing one or more systems employing the atomic interactions, atomic contacts, or 

structural coordinates of a binding pocket of an RTK, for identifying agents by their ability to 
inhibit or potentiate the atomic interactions or atomic contacts, or providing one or more 
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systems for identifying agents by their ability to inhibit or potentiates autoinhibited state or 
active state of a kinase domain of an RTK; 
(b) (optionally) conducting therapeutic profiling of agents identified in step (a) for efficacy and 
toxicity in animals; and 

5 (c) licensing, to a third party, the rights for further drug development and/or sales for agents 

identified in step (a), or analogs thereof. 
Methods are also provided for regulating the kinase domain of an RTK by changing a binding 
domain or pocket of a RTK that regulates the kinase domain from an autoinhibited state to an active state 
or from an active state to an autoinhibited state. A binding domain or pocket of a RTK may be changed 
10 from an autoinhibited state by altering amino acid residues forming the binding pocket (e.g. introducing 
mutations) or using a modulator. 

In an aspect the invention provides a method for inhibiting kinase activity of an RTK comprising 
maintaining the RTK or a binding pocket thereof involved in regulating the kinase domain in an 
autoinhibited state, or potentiating an autoinhibited state for the RTK or binding pocket thereof involved in 
15 regulating the kinase domain. An autoinhibited state may be maintained or potentiated by inhibiting 
phosphorylation of phosphoregulatory sites of the juxtamembrane segment and/or kinase domain (e.g. 
activation segment). Inhibition may be accomplished using modulators, or altering the structure of a 
binding pocket of the RTK comprising the phosphoregulatory sites, to prevent phosphorylation of the sites. 
The invention contemplates a method for altering the stability of an autoinhibited state of an RTK 
20 comprising phosphorylating phosphoregulatory sites of a juxtamembrane region of the RTK. 

In an aspect the invention relates to a method for changing an RTK from an autoinhibited state to 
an active state comprising phosphorylating phosphoregulatory sites of a juxtamembrane region of the 
RTK. 

In another aspect the invention provides a method for activating kinase activity of an RTK 
25 comprising phosphorylating phosphoregulatory sites of a juxtamembrane region and kinase domain (e.g. 
activation segment) of the RTK involved in mamtaining the RTK in an autoinhibited state. 

The invention also contemplates a method of treating or preventing a condition or disease 
associated with an RTK in a cellular organism, comprising: 

(a) administering a modulator of the invention in an acceptable pharmaceutical preparation; and 
30 (b) activating or inhibiting the RTK to treat or prevent the disease. 

In an aspect the invention provides a method for treating or preventing a condition or disease 
involving increased RTK activity comprising mamtaining the RTK or a binding pocket thereof involved in 
regulating the kinase domain of the RTK in an autoinhibited state. An autoinhibited state may be 
maintained as described herein. In an embodiment the condition or disease is cancer. 
35 The invention provides for the use of a modulator identified by the methods of the invention in 

the preparation of a medicament to treat or prevent a disease in a cellular organism. Use of modulators of 
the invention to manufacture a medicament is also provided. 
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These and other aspects of the present invention will become evident upon reference to the 
following detailed description and Tables, and attached drawings. 
DESCRIPTION OF THE DRAWINGS AND TABLES 

The present invention will now be described only by way of example, in which reference will be 
5 made to the following Figures: 

Figure 1. Structure-based sequence alignment of the juxtamembrane segments and kinase 
domains of murine and human EphB2, murine EphA4 and cAPK, and human IRK, FGFR1, Hck, Kit, 
PDGFRp, and Flt3. The secondary structure elements of murine EphB2 are indicated, with the 
juxtamembrane segment, the N-terminal kinase, the g-loop, and the C-terminal lobe coloured red, green, 
10 orange, and blue, respectively. Residues Phe 620 and Tyr 750 and those marked with a star are involved in 
the j uxtamembrane/kinase domain interface. The two juxtamembrane tyrosines (604 and 610) that were 
mutated to phenylalanine are highlighted in light blue. Additional tyrosines identified by Kalo and 
Pasquale (1999) as in vivo phosphorylation sites are highlighted in purple. The solid triangle indicates the 
site of a 16 amino acid insertion in chicken EphB2 resulting from alternate RNA processing (Connor and 
15 Pasquale, 1995). For Kit and PDGFRp, tyrosines highlighted in yellow denote autophosphorylation sites, 
while sites of activating point mutations and deletions are shaded gray (Tsujimura et al., 1996; Irusta and 
DiMaio, 1998; Kitayama et al., 1995; Hirota et al., 1998). Hie locations and regions of duplicated 
sequence for activating Flt3 mutations are indicated by solid black triangles and underlining (Hayakawa et 
al,2000). 

20 Figure 2. Overview of the autoinhibited EphB2 structure, (a) Ribbon diagram of the EphB2 

crystal structure in complex with AMP-PNP. The juxtamembrane region, N-ternunal kinase lobe, C- 
terminal kinase lobe, and g-loop are coloured red, green, blue and orange, respectively. Phosphoregulatory 
residues Tyr/Phe 604 and Tyr/Phe 610 are coloured light blue, Tyr667 is coloured purple, and the adenine 
moiety of AMP-PNP is coloured red. (b) Ribbon representation of EphB2 colored as in (a), rotated 90° 

25 about the vertical axis, (c) and (d) The juxtamembrane regions in (a) and (b), respectively, have been 
magnified to detail the interactions between the juxtamembrane region and helix aC of the N-terminal 
kinase lobe. Carbon, oxygen, nitrogen and sulfur atoms are shown in yellow, red, blue, and green, 
respectively. Residues involved in the juxtamembrane/kinase domain interface but not shown include 
Ala616, Ala621, Leu676, Leu693, and Val696. All ribbon diagrams were prepared with RIBBONS 

30 (Carson, 1991b). 

Figure 3. Comparison of autoinhibited EphB2 RTK with the active insulin receptor kinase, (a) 
Superposition of EphB2 with active insulin receptor kinase (Protein Data Bank ID code lir3). The 
backbone of the juxtamembrane region of EphB2 is shown in red, with the side chains of Tyr/Phe 604 and 
Tyr/Phe 610 coloured light blue. The EphB2 kinase domain, g-loop and bound adenine moiety are colored 
35 blue, orange and red, respectively. The backbone of active IRK is coloured dark green with its activation 
segment, g loop, and bound nucleotide shown in purple, pink, and light green respectively. The two 
receptors were aligned using all elements of the C-terminal lobes except the kinase insert region, the 
activation segment, helix cJ, and the C-terminal tail (rms fit = 1.91 A), (b) Stereo view of the boxed 
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region in (a), with EphB2 phosphorylation sites shown in purple, other EphB2 side chain atoms coloured 
as in Figure 2c and 2d and IRK side chains shown in green and pink, (c) Stereo view of the boxed region 
in A), highlighting the kinase catalytic region. This panel is colored as in (b), (d) Stereo view of boxed 
region in A) highlighting switch region 1. Inactive IRK (Protein Data Band ID code lirk), shown in 
5 yellow, is also superimposed. All side chains are colored according to their respective backbones. IRK 
residue labeled Thr776 corresponds to Ser776 in EphB2. 

Figure 4. Electrostatic surface representation of EphB2. Blue and red regions indicate positive 
and negative potential, respectively (10 to -10 k B T). Phosphoregulatory residues Tyr/Phe 604 and Tyr/Phe 
610 are coloured light blue. The molecular surface of EphB2 is oriented as in Figure 2a and was generated 
10 using GRASP (NichoUs et aL, 1991) 

Figure 5. Comparison of the kinase activities of EphA4 and EphB2 wild-type and mutant 
proteins, (a) GST-EphA4 proteins were expressed in E.coli, and cell lysates were subjected to immunoblot 
analysis with anti-pTyr antibody (top panel) and anti-GST antibody (lower panel), (b) Equal quantities of 
GST-EphA4 proteins bound to glutathione sepharose were assessed for their ability to autophosphorylate 
15 and phosphorylate enolase by an in vitro kinase assay (top panel). Immunblot analysis of GST-EphA4 
proteins with anti-GST antibody (lower panel), (c) Histogram of the specific activities of EphA4 wild-type 
and mutant proteins as measured by the spectrophotometric coupling assay at 1 mM S-l peptide and 
0.5uM EphA4 proteins. The velocities represent the mean of triplicate reactions and have been normalized 
to the specific activity of wild-type EphA4 (top panel). Coomassie stained SDS-PAGE analysis of EphA4 
20 proteins (lower panel), (d) EphB2 and its mutants were expressed in COS-1 cells and immunoprecipitated. 
The immunoprecipitates were resolved by SDS-PAGE, immunoblotted with anti-pTyr (top panel) or anti- 
EphB2 (middle panel) antibodies, and assessed for their ability to autophosphorylate and phosphorylate 
enolase by an in vitro kinase assay (bottom panel). 

Figure 6. Schematic diagram highlighting differences between the autoinhibited (left) and active 
25 (right) states of the Eph receptor family of tyrosine kinases. The active configuration is based on the 
crystal structure of active IRK (Protein Data Bank ID code lir3). Dashed lines indicate regions of 
activation segment disorder. The numbering scheme corresponds to murine EphB2. 

The present invention will now be described only by way of example, in which reference will be 
made to the following Tables: 
30 Table 1 shows the data collection, structure determination and refinement statistics 

Table 2 shows intermolecular contacts in a binding pocket of the invention. 

Table 3 shows the structural coordinates of the juxtamembrane region and kinase domain of an 
EphB2 receptor. 

In Table 3, from the left, the second column identifies the atom number; the third identifies the 
35 atom type; the fourth identifies the amino acid type; the sixth identifies the residue number; the seventh 
identifies the x coordinates; the eighth identifies the y coordinates; the ninth identifies the z coordinates; 
the tenth identifies the occupancy; and the eleventh identifies the temperature factor. 
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DETAILED DESCRIPTION OF THE INVENTION 

Unless otherwise indicated, all terms used herein have the same meaning as they would to one 
skilled in the art of the present invention. Practitioners are particularly directed to Current Protocols in 
Molecular Biology (Ansubel) for definitions and terms of the art. 
5 In accordance with the present invention there may be employed conventional biochemistry, 

enzymology, molecular biology, crystallography, bioinformatics, microbiology, and recombinant DNA 
techniques within the skill of the art. Such techniques are explained fully in the literature. v See for example, 
Sambrook, Fritsch, & Maniatis, Molecular Cloning: A Laboratory Manual, Second Edition (1989) Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y); DNA Cloning: A Practical Approach, 

10 Volumes I and II (D.N. Glover ed. 1985); Oligonucleotide Synthesis (M..J. Gait ed. 1984); Nucleic Acid 
Hybridization B.D. Hames & S.J. Higgins eds. (1985); Transcription and Translation B.D. Hames & S.J. 
Higgins eds (1984); Animal Cell Culture R.I. Freshney, ed. (1986); Immobilized Cells and enzymes IRL 
Press, (1986); and B. Perbal, A Practical Guide to Molecular Cloning (1984). 

For ease of reference the murine numbering scheme for EphB2 is employed herein to describe 

15 specific amino acid residues in aspects of the invention. However, a person skilled in the art could readily 
determine the corresponding amino acid residues in other RTKs, more particularly in Eph receptors. 
Receptor Tyrosine Kinases (RTKs) 

The invention generally relates to RTKs. RTKs mediate pathways involving multiple extracellular 
and intracellular signals, integration and amplification of these signals by second messengers, and the 

20 activation of cellular processes including cell proliferation, cell division, cell growth, the cell cycle, cell 
differentiation, cell migration, axonogenesis, nerve cell interactions, and regeneration. Signaling pathways 
mediated by receptor tyrosine kinases may be initiated by growth fectors bmding to specific RTKs on cell 
surfaces. The binding of a growth factor to its receptor activates RTK signaling pathways. The RTKs have 
an extracellular N-terminal domain that binds the growth factor and a cytoplasmic C-terminal domain 

25 containing a protein tyrosine kinase that is capable of autophosphorylation, and the phosphorylation of 
other protein substrates. Autophosphorylation takes place within a region of the kinase domain of the RTK 
termed the "activation segmenf 1 (Weinnmaster et al., 1984). The binding of a growth factor to its receptor 
activates the tyrosine kinase which phosphorylates a variety of signaling molecules thereby initiating 
signaling pathways that can lead to DNA replication, RNA and protein synthesis, and cell division. 

30 Receptor tyrosine kinases within the scope of the present invention include but are not limited to 

epidermal growth factor receptor (EGFR), PDGF receptor, insulin receptor tyrosine kinase (IRK), Met 
receptor tyrosine kinase, fibroblast growth factor (FGF) receptor, insulin receptor, insulin growth factor 
(IGF-1) receptor, TrkA receptor, IL-3 receptor, B cell receptor, TIE-1, Tek/Tie2, Flt-1, Flk, VEGFR3, 
EFGR/Erbb, Erb2/neu, Erb3, Ret, Kit, AJk, Axl, FGFR1, FGFR2, FGFR3, Hck, cAPK, keratinocyte 

3 5 growth factor (KGF) receptor, and Eph receptors. 

The invention preferably contemplates Eph receptors, more preferably EphB2 receptors. 
The term <f Eph receptor" refers to a subfamily of closely related transmembrane receptor tyrosine 
kinases related to Eph, a receptor named for its expression in an eiythropoietin-producing human 
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hepatocellular carcinomas cell line. The receptors contain cell adhesion-like domains on their extracellular 
surface. The N-terminal extracellular region of all Eph family members contains a domain necessary for 
ligand binding and specificity, followed by a cysteine-rich domain and two fibronectin type II repeats. The 
cytoplasmic region has a centrally located tyrosine kinase domain. C-terminal to the catalytic region is a 
5 sterile alpha motif (SAM) domain, which forms dimers of oligomers in solution and may contribute to 
regulation of receptor clustering. Localization of clustering of Eph proteins may also be influenced by PDZ 
domain effectors which potentially interact with specific C-terminal receptor motifs. 

N-terminal to the kinase domain is the juxtamembrane domain. Two invariant tyrosine residues 
(tyrosines 596 and 602 of EphA4; tyrosines 604 and 610 of EphB2) in the juxtamembrane domain are 

10 embedded in a characteristic and highly conserved -10 amino acid sequence motif. These tyrosine residues 
are major sites for autophosphorylation and they have been found to associate with a number of SH2 
domain-containing cytoplasmic proteins such as Ras GTPase-activating protein (RasGAP), the p85 subunit 
of phosphatidylinositol 3' kinase, Src family kinases, the adapter protein Nek, and SHEP-1 which binds 
the R-Ras and RaplA GTPases. Signaling mediated by such SH2 domain-containing proteins may 

15 contribute to the physiological effects of Eph receptor stimulation on cell adhesion and cytoskeletal 
structures. 

There are currently 14 related vertebrate members of the Eph receptor family including receptors 
in Caenorhabditis elegans and Drosophila. Eph receptors are activated by ephrins. Ephrins are attached to 
the plasma membrane either via a glycosylphosphatidylinositol linkage (A class) or a transmembrane 

20 sequence (B class). Eph receptors are also divided into A and B classes corresponding to their ligand 
binding specificities and phylogenetic relationships. Class A receptors generally bind A class ephrins, 
whereas B class ephrins stimulate B class receptors. However, EphA4 is an exception in that it binds and 
responds to B as well as A class ephrins. 

The group that includes receptors interacting preferentially with ephrin A proteins is called EphA 

25 and includes EphAl (also known as Eph and Esk), EphA2 (also known as Eck, Myk2, Sek2), EphA3 (also 
known as Cek4, Mek4, Hek, Tyro4, Hek4), EphA4 (also known as Sek, Sekl, Cek8, Hek8, Tyrol), EphA5 
(also known as Ehkl, Bsk, Cek7, Hek7, and Rek7), EphA6 (Ehk2, and Hekl2) EphA7 (also known as 
Mdkl, Hekll, Ehk3, Ebk, Cekll), and EphA8 (also known as Eek, Hek3). The group that includes 
receptors interacting preferentially with ephrin B proteins is called Eph B and includes EphBl (also known 

30 as Elk, Cek6, Net, Hek6), EphB2 (also known as Cek5, Nuk, Erk, Qek5, Tyro5, Sek3, hek5, Drt), EphB3 
(also known as CeklO, Hek2, Mdk5, Tyro6, and Sek4), EphB4 (also known as Htk, Mykl, Tyrol 1, Mdk2), 
EphB5 (also known as Cek9, Hek9), and EphB6 (also known as Mep). 

"Ephrin" refers to a class of ligands which are anchored to the cell membrane through a 
transmembrane domain, and bind to the extracellular domain of an Eph receptor, facilitating dimerization 

35 and autophosphorylation of the receptor and autophosphorylation of the ligand. The ephrin-A ligands 
(GPI-anchored ligands) are ephrin-A (also known as B61, LERK1, EFL-1), ephrin-A2 (also known as 
LERK6, Elfl, mCek7-L, cElfl), ephrin-A3 (also known as LERK3, Ehkl-L, and EFL-2), ephrin-A4 (atso 
known as LERK4, EFL-4, mLERK4), ephrin-A5 (AL1, LERK7, EFL-5, mALl, [rLERK7], RAGS). The 
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ephrin-B ligands (transmembrane ligands) are ephrin-Bl (also known as LEKR2, ELK-L, EFL-3, Cek5-L, 
Stral, [LERK2]), eplirin-B2 (also known as LERK5, HTK-L, NLERK1, El£2, Htk-L), and ephrin-B3 (also 
known as LERK8, ELK-L3, NLERK2, EFL-6, El£3, [rELK-L3]). 

RTKs may be derivable from a variety of sources, including viruses, bacteria, fungi, plants and 
5 animals. In a preferred embodiment an RTK is derivable from a mammal, for example, a human. 

An RTK in the present invention may be a wild type enzyme, or part thereof, or a mutant, variant 
or homolog of such an enzyme. 

The term "wild type" refers to a polypeptide having a primary amino acid sequence which is 
identical with the native enzyme (for example, the human enzyme). 

10 The term "mutant* ' refers to a polypeptide having a primary amino acid sequence which differs 

from the wild type sequence by one or more amino acid additions, substitutions or deletions. Preferably, 
the mutant has at least 90% sequence identity with the wild type sequence. Preferably, the mutant has 20 
mutations or less over the whole wild-type sequence. More preferably the mutant has 10 mutations or less, 
most preferably 5 mutations or less over the whole wild-type sequence. 

15 The term "variant" refers to a naturally occurring polypeptide which differs from a wild-type 

sequence. A variant may be found within the same species (i.e. if there is more than one isoform of the 
enzyme) or may be found within a different species. Preferably the variant has at least 90% sequence 
identity with the wild type sequence. Preferably, the variant has 20 mutations or less over the whole wild- 
type sequence. More preferably, the variant has 10 mutations or less, most preferably 5 mutations or less 

20 over the whole wild-type sequence. 

The term "part" indicates that the polypeptide comprises a fraction of the wild-type amino acid 
sequence. It may comprise one or more large contiguous sections of sequence or a plurality of small 
sections. The "part" may comprise a binding pocket as described herein. The polypeptide may also 
comprise other elements of sequence, for example, it may be a fusion protein with another protein (such as 

25 one which aids isolation or crystallisation of the polypeptide). Preferably the polypeptide comprises at 
least 50%, more preferably at least 65%, most preferably at least 80% of the wild-type sequence. 

The term "homolog" means a polypeptide having a degree of homology with the wild-type amino 
acid sequence. The term "homology" refers to a degree of complementarity. There may be partial 
homology or complete homology. In an embodiment of the invention a RTK is substantially homologous 

30 to a wild type enzyme. A sequence that is "substantially homologous" refers to a partially complementary 
sequence that at least partially inhibits an identical sequence from hybridizing to a target nucleic acid. 
Inhibition of hybridization of a completely complementary sequence to the target sequence may be 
examined using a hybridization assay (e.g. Southern or northern blot, solution hybridization, etc.) under 
conditions of reduced stringency. A sequence that is substantially homologous or a hybridization probe 

35 will compete for and inhibit the binding of a completely homologous sequence to the target sequence 
under conditions of reduced stringency. However, conditions of reduced stringency can be such that non- 
specific binding is permitted, as reduced stringency conditions require that the binding of two sequences to 
one another be a specific (i.e., a selective) interaction. The absence of non-specific binding may be tested 
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using a second target sequence which lacks even a partial degree of complementarity (e.g., less than about 
30% homology or identity). The substantially homologous sequence or probe will not hybridize to the 
second non-complementary target sequence in the absence of non-specific binding. 

A sequence of an RTK may have at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 99% 
5 identity. The phrase "percent identity" or "% identity" refers to the percentage of sequence similarity found 
in a comparison of two or more amino acid sequences. Percent identity can be determined electronically 
using conventional programs, e.g., by using the MEGALIGN program (LASERGENE software package, 
DNASTAR). The MEGALIGN program can create alignments between two or more amino acid sequences 
according to different methods, e.g., the Clustal Method. (Higgins, D. G. and P. M. Sharp (1988) Gene 

10 73:237-244.) Gaps of low or of no homology between the two amino acid sequences are not included in 
detennining percentage similarity. 

In the present context, a homologous sequence is taken to include an amino acid sequence which 
may have at least 75, 85 or 90% identity, preferably at least 95 or 98% identity to the wild-type sequence. 
The homologs will comprise the same sites (for example, binding pocket) as the subject amino acid 

15 sequence. 

A sequence may have deletions, insertions or substitutions of amino acid residues which produce 
a silent change and result in a functionally equivalent enzyme. Deliberate amino acid substitutions may be 
made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the 
amphipathic nature of the residues as long as the secondary binding activity of the substance is retained. 

20 For example, negatively charged amino acids include aspartic acid and glutamic acid; positively charged 
amino acids include lysine and arginine; and amino acids with uncharged polar head groups having similar 
hydrophilicity values include leucine, isoleucine, valine, glycine, alanine, asparagine, glutamine, serine, 
threonine, phenylalanine, and tyrosine. 

The polypeptide may also have a homologous substitution (substitution and replacement are both 

25 used herein to mean the interchange of an existing amino acid residue, with an alternative residue) i.e. like- 
for-like substitution such as basic for basic, acidic for acidic, polar for polar etc. Non-homologous 
substitution may also occur Le« from one class of residue to another or alternatively involving the inclusion 
of unnatural amino acids such as ormthine (hereinafter referred to as Z), diaminobutyric acid onuthine 
(hereinafter referred to as B), norleucine ornithine (hereinafter referred to as O), pyriylalanine, 

30 thienylalanine, naphthylalanine and phenylglycine. 
BINDING POCKET 

"Binding pocket" refers to a region or site of a RTK or molecular complex thereof that as a result 
of its shape, associates with another region of the RTK or with a ligand or a part thereof. A binding pocket 
may regulate the kinase domain of the RTK. A binding pocket may be involved in maintaining an 
35 autoinhibited state or active state of an RTK. For example, a binding pocket may comprise part of a 
juxtamembrane region of an RTK that associates with a kinase domain of the RTK (e.g. strand segment 
Exl), a site formed by interacting amino acid residues in the juxtamembrane region (e.g. switch region 2), 
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a site formed by interacting amino acid residues in the juxtamembrane region and kinase domain (switch 
region 1), or a region responsible for binding a ligand. 

The invention contemplates a binding pocket of an RTK in an autoinhibited state or an active 

state. 

5 A "ligand" refers to a compound or entity that associates with a binding pocket including 

nucleotides or analogues or parts thereof, substrates or analogues or parts thereof; or modulators of RTKs, 
including inhibitors. A ligand may be designed rationally by using a model according to the present 
invention. 

In an aspect of the invention a binding pocket comprises one or more of the residues involved in 

10 coordination of a nucleotide or analog thereof, in particular the amino acid residues involved in 
coordinating the sugar and phosphate groups of the nucleotide. 

In an aspect of the invention the binding pocket comprises phosphoregulatory sites of a 
juxtamembrane region or kinase domain. Phosphoregulatory sites are sites that are autophosphorylated 
following ligand binding of an RTK and that potentiate binding of cytoplasmic signalling targets such as 

15 SH2 or SH3 domain signalling proteins. In a specific aspect the binding pocket comprises invariant 
tyrosine residues (e.g. tyrosines 596 and 602 of EphA4; tyrosines 604 and 610 of EphB2) within a 
conserved amino acid sequence (e.g. YIDPFTYEPD in EphB2) in the juxtamembrane region 

A binding pocket may comprise one or more of the amino acid residues for an Eph receptor 
crystal identified as numbers 1 through 49 shown in Table 2. In an aspect the binding pocket comprises the 

20 atomic contacts of atomic interactions 1 to 24 (juxtamembrane-kinase interactions) or interactions 25 to 49 
Ouxtamembrane-juxtamembrane interactions) identified in Table 2. In a preferred embodiment the binding 
pocket comprises atomic interactions or atomic contacts 27, 28, 29, and 38; 39 and 40; or 9, 13, 14, 16, 18, 
19, 32, 39, 40, and 42 in Table 2. In an aspect of the invention the binding pocket comprises all of the 
amino acid residues identified in Table 2. 

25 A binding pocket may be involved in coordination of a ligand or substrate. For example a binding 

pocket may be involved in coordination of a nucleotide, or part or analog thereof Therefore, a binding 
pocket may comprise two or more of the amino acid residues Phe 709, Met 710 Glu 708, Thr 707, Leu 
761, Gly 713, (Lys 661), Ala 659, He 691, and (Ser 771) of an RTK structure as described herein, that are 
capable of associating with or coordinating a nucleotide as described herein. 

30 The term "binding pocket" (BP) also includes a homolog of the binding pocket or a portion 

thereof. As used herein, the term "homolog" in reference to a binding pocket refers to a binding pocket or a 
portion thereof which may have deletions, insertions or substitutions of amino acid residues as long as the 
binding specificity is retained. In this regard, deliberate amino acid substitutions may be made on the basis 
of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipafhic nature 

35 of the residues as long as the binding specificity of the binding pocket is retained. 

As used herein, the term "portion thereof 1 means the structural coordinates corresponding to a 
sufficient number of amino acid residues of a binding pocket (or homologs thereof) that are capable of 
providing an autoinhibited or active state or for associating with a ligand. For example, the structural 
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coordinates provided in a crystal structure may contain a subset of the amino acid residues in a binding 
pocket which may be useful in the modelling and design of compounds that bind to the binding pocket 
AUTOINHIBITED/ACTIVE STATE 

An RTK or a binding pocket thereof may be in an autoinhibited state or active state. An 
5 "autoinhibited state" refers to the state of a RTK or a binding pocket that results in disruption of the 
activation segment of the kinase domain and effective coordination of bound nucleotide. The autoinhibited 
state results in perturbed catalytic function of an RTK. An autoinhibited state typically occurs in the 
absence of phosphorylation of the RTK. 

An "active state" refers to the state of a RTK or a binding pocket that does not result in disruption 

10 of the activation segment of the kinase domain and effective coordination of bound nucleotide. In the 
active state the RTK is catalytically active and the juxtamembrane segment is free to bind to signalling 
proteins such as SH2 domain containing proteins, including pl20-RasGAP, Nek, phosphatidylinositol 3'- 
kinase, SHEP-1, Src family kinases, and the adapter protein Nek. An active state typically occurs in the 
presence of phosphorylation of the RTK. 

15 CRYSTAL 

The invention provides crystal structures. As used herein, the term "crystal" or "crystalline" 
means a structure (such as a three dimensional (3D) solid aggregate) in which the plane faces intersect at 
definite angles and in which there is a regular structure (such as internal structure) of the constituent 
chemical species. Thus, the term "crystal" can include any one of: a solid physical crystal form such as an 

20 experimentally prepared crystal, a crystal structure derivable from the crystal (including secondary and/or 
tertiary and/or quaternary structural elements), a 2D and/or 3D model based on the crystal structure, a 
representation thereof such as a schematic representation thereof or a diagrammatic representation thereof, or 
a data set thereof for a computer. 

In one aspect, the crystal is usable in X-ray crystallography techniques. Here, the crystals used can 

25 withstand exposure to X-ray beams used to produce a diffraction pattern data necessary to solve the X-ray 
crystallographic structure. A crystal may be characterized as being capable of diffracting x-rays in a pattern 
defined by one of the crystal forms depicted in Blundel et al 1976, Protein Crystallography, Academic 
Press. 

A crystal of the invention is generally produced in a laboratory; that is, it is an isolated crystal 
30 produced by an individual. 

The invention contemplates a crystal comprising a binding pocket of the invention, in particular a 
binding pocket that regulates the kinase domain of the receptor tyrosine kinase. The binding pocket may be 
of an autoinhibited state RTK or an active RTK. 

In an aspect of the invention a crystal is provided that comprises the juxtamembrane region and 
35 kinase domain of an RTK. In an embodiment the RTK is an Eph Receptor, preferably an EphB receptor. In 
a preferred embodiment the crystal comprises the juxtamembrane region and the catalytic domain (amino 
acid residues 595 to 906) of EphB2. The juxtamembrane region and the catalytic domain may be in an 
autoinhibited state. 
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A crystal of the invention may be characterized by one or more of the following characteristics: 

(a) an N-terrhinal lobe for binding and coordinating ATP for transfer of an a-phosphate to a 
substrate, comprising a twisted 5-strand P-sheet (denoted pi to P5) and a single helix 
aC; and optionally further characterized by (i) a flexible loop that interacts with the 

5 adenine base, ribose sugar and the non-hydrolyzable phosphate groups of ATP which 

loop is formed by P-strands 1 and 2 and a connecting glycine rich segment (g-loop) and 
(ii) an invariant salt bridge between a lysine side chain in P strand 3 and a glutamic acid 
side chain in helix ccC that coordinates the position of the p-phosphate of ATP; and 

(b) a C-terminal lobe comprising two P-strands (P7 and p8) and a series of a-helices (aD to 
10 al) which is further characterized by (i) strands P7 and p8 in the cleft region between the 

N- and C- terminal lobes where they contribute side chains that participate in catalysis 
and the binding of magnesium for the coordination of ATP phosphate groups, (ii) an 
activation segment flanked by the sequence Asp-Phe-Gly of sub-domain VII and Pro-Ile- 
Arg of sub-domain Vm, and (iii) a helix ocl adjacent to helix ccH. 
15 A crystal of the invention comprising a juxtamembrane region of an RTK, in particular an Eph 

receptor, more particularly an EphB receptor, most particularly an EphB2 receptor, may be characterized 
as comprising a single-turn helix aA* (i.e. a 3/10 helix), and a four-turn helix aB 1 from the amino terminus 
of an extended strand segment Exl. The crystal may also comprise this juxtamemembrane region in 
association with interacting amino acid residues on the N- and C-terminal lobes of the RTK. (See Figures 
20 2-4, and Table 2.) 

A crystal of the invention may comprise a juxtamembrane strand segment Exl comprising amino 
acid residues Lys 602 to lie 605 which strand extends along the cleft region between the N-and C-terminal 
lobes of an RTK. The strand is stabilized by hydrogen bonding interactions involving the amide group of 
Phe 604 with the carbonyl group of Met 748 and the Gin 684 side chain with the backbone amide and 

25 carbonyl groups of He 605. 

In a further aspect of the invention a crystal is provided comprising a hydrophobic interface site 
(referred to herein as switch region 1) comprising side chains of Met 748 and Tyr 750 of the C-terminal 
kinase lobe; Phe 685 and He 681 from helix ccC, and Pro 607 from the juxtamembrane helix aA 1 , and the 
phosphoregulatory site or residue Phe 604 which orients into the site. 

30 A crystal of the invention may comprise helix ctA 1 which is more particularly characterized by 

one or more of the following characteristics: 

(a) it is composed of a single rigid turn initiated by Asp 606 and Pro 607 and tenninated by Thr 609; 

(b) it is stabilized by the conformational regidity of Pro 607 and the capping interactions involving 
Asp 606 and Thr 609 with the free backbone amino group and carbonly groups of Phe 608 and 

35 Asp 606. 

A crystal of the invention may comprise helix aB 1 which is more particularly characterized by 
one or more of the following characteristics: 
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(a) it is initiated by an Asp Pro sequence (residues 612 and 613); and 

(b) Asp 612 makes capping interactions with the backbone amino and side chain of Asn 614. 

A crystal of the invention may comprise helices aA' and aB' of a juxtamembrane region of an 
RTK and the portion of the N-terminal lobe of the kinase domain centering on helix aC of the RTK which 
5 forms an interface with helices ccA' and aB* and is further characterized as follows: 

(a) hydrophobic side chains projecting from aA' and aB' include Pro 607, Phe 608, Pro 613, 
Val 617, Phe620 and Ala 621 which residues associate intimately with the side chains of Arg 
673, Leu 676, and He 681 from helix ocC and the side chains of Leu 693 and Val 696 from p- 
strand 4; 

10 (b) ahydrogen bond interaction (2.9A) between Asn 614 and Arg 672; and 

(c) the small side chains at positions 616 (Ala), 677 (Ser) and 680 (Ser) facilitate fee close 
packing of helices aA', aB' and aC. 

A crystal of the invention may comprise a hydrophobic interface site (also referred to herein as 
"switch region 2") formed by association of helix aC, strand Exl and helices aA* and aB' of the 
1 5 juxtamembrane region of an RTK. The interface is characterized as follows: 

(a) projection of the side chain of the phosphoregulatory residue Tyr/Phe 610 onto the surface of 
the site; 

(b) composed of the side chains of He 605 from strand Exl and the side chains of Ala 616 and 
Phe 620 from helix aB'; and 

20 (c) an electrostatic environment dominated by Asp 606, Giu 611, Asp 612, Glu 615, and Glu 

619. 

A crystal of the invention may comprise the following amino acids residues: 
(a) Arg 672, Phe Arg 672, Phe 675, and Leu 676 from helix aC, Tyr 667 from the (33 / aC linker 
and Leu 663, Val 696, Thr 698, Val 703, and He 705; or 
25 (b) Met 748, Tyr 750, Phe 685, lie 681, Pro 607, and Phe 604; or 

(c) Phe 709, Met 710, Glu 708, Thr 707, Leu 761, Gly 713, Ala 659, Ee 691, Lys 661, and Ser 
771; or 

(d) Asp 606, Pro 607, Thr 609, Phe 608 and Asp 606; or 

(e) Asp 612, Pro 613, Asp 612, and Asn 614; or 

30 (t) Pro 607, Phe 608, Pro 613, Val 617, Phe 620, Als 621, Arg 673, Leu 676, He 681, Leu 693, 

Val 696, Asn 614, Arg 672, Ala 616, Ser 677, and Ser 680; or 
(g) Tyr/Phe 610, He 605, Ala 616, Phe 620, Asp 606, Glu 61 1, Asp 615, and Glu 619. 
Preferably the atoms of the amino acid residues in (a) to (g) have the structural coordinates as set 
out in Table 3. 

35 In an embodiment, a crystal of a Eph receptor of the invention belongs to space group P2j or PL 

The term "space group" refers to the lattice and symmetry of the crystal In a space group designation the 
capital letter indicates the lattice type and the other symbols represent symmetry operations that can be 
carried out on the contents of the asymmetric unit without changing its appearance. 
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A crystal of the invention may comprise a unit cell having the following unit dimensions: a " 
47.05 (±0.05) A, b = 57.62 (±0.05) A, c - 67.74 (±0.05) A, or a - 47.86 (± 0.05) A, b - 98.09 (±0.05) 
A, c « 68.18 (±0.05) A. The term "unit cell" refers to the smallest and simplest volume element (i.e. 
parallelpiped-shaped block) of a crystal that is completely representative of the unit of pattern of the 
5 crystal. The unit cell axial lengths are represented by a, b, and c. Those of skill in the art understand that a 
set of atomic coordinates determined by X-ray crystallography is not without standard error. 

In a preferred embodiment, a crystal of the invention has the structural coordinates as shown in 
Table 3. As used herein, the term "structural coordinates" refers to a set of values that define the position 
of one or more amino acid residues with reference to a system of axes. The term refers to a data set that 
10 defines the three dimensional structure of a molecule or molecules (e.g. Cartesian coordinates, temperature 
factors, and occupancies). Structural coordinates can be slightly modified and still render nearly identical 
three dimensional structures. A measure of a unique set of structural coordinates is the root-mean-square 
deviation of the resulting structure. Structural coordinates that render three dimensional structures (in 
particular a three dimensional structure of a ligand binding pocket) that deviate from one another by a root- 
15 mean-square deviation of less than 5 A, 4 A, 3 A, 2 A, 1.5 A. 1.0 A, or 0.5 A may be viewed by a person 
of ordinary skill in the art as very similar. 

Variations in structural coordinates may be generated because of mathematical manipulations of 
the structural coordinates of a glycosyltransferase described herein. For example, the structural coordinates 
of Table 3 may be manipulated by crystallographic permutations of the structural coordinates, 
20 fractionalization of the structural coordinates, integer additions or substractions to sets of the structural 
coordinates, inversion of the structural coordinates or any combination of the above. 

Variations in the crystal structure due to mutations, additions, substitutions, and/or deletions of 
the amino acids, or other changes in any of the components that make up the crystal may also account for 
modifications in structural coordinates. If such modifications are within an acceptable standard error as 
25 compared to the original structural coordinates, the resulting structure may be the same. Therefore, a ligand 
that bound to a binding pocket of an RTK, in particular an Eph receptor, would also be expected to bind to 
another binding pocket whose structural coordinates defined a shape that fell within the acceptable error. 
Such modified structures of a binding pocket thereof are also within the scope of the invention. 

Various computational analyses may be used to determine whether a molecule or the binding 
30 pocket thereof is sufficiently similar to all or parts of an RTK or a binding pocket thereof. Such analyses 
may be carried out using conventional software applications and methods as described herein. 

A crystal of the invention may also be specifically characterised by the parameters, diffraction 
statistics and/or refinement statistics set out in Tables 1. 

With reference to a crystal of the present invention, residues in a binding pocket may be defined 
35 by their spatial proximity to a ligand in the crystal structure. For example, a binding pocket may be 
defined by its proximity to a nucleotide, substrate molecule, or modulator. 

A crystal of the invention may comprise a binding pocket that is involved in coordination of a 
nucleotide, or part or analog thereof. Therefore, a crystal may comprise a binding pocket comprising two 
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or more of the amino acid residues Phe 709, Met 710 Glu 708, Thr 707, Leu 761, Gly 713, (Lys 661), Ala 
659, He 691, and (Ser 771) of an RTK structure as described herein, that are capable of associating with or 
coordinating a nucleotide as described herein. 

A crystal or secondary or three-dimensional structure of a binding pocket of an RTK, in particular 
5 an EphB2 receptor, may be specifically defined by one or more of the atomic contacts of the atomic 
interactions identified in Table 2. The atomic interactions in Table 2 are defined therein by an atomic 
contact (more preferably, a specific atom of an amino acid residue where indicated) on the juxtamembrane 
region, and an atomic contact (more preferably, a specific atom of an amino acid residue where indicated) 
on the kinase domain, juxtamembrane region, or ligand. In certain embodiments, a crystal of the invention 
10 comprises the atomic contacts of atomic interactions 1 to 24 (juxtamembrane-kinase interactions) or 
atomic interactions 25 to 49 (juxtamembrane-juxtamembrane interactions) identified in Table 2. In certain 
particular embodiments a crystal is provided comprising the atomic contacts of atomic interactions 27, 28, 
29, and 38; 39 and 40; or 9, 13, 14, 16, 18, 19, 32, 39, 40, and 42. 

Preferably, a crystal is defined by the atoms of the atomic contacts in the binding pocket having 
15 the structural coordinates for the atoms listed in Table 3. 

A crystal of the invention includes a binding pocket in association with one or more moieties, 
including heavy-metal atoms Le. a derivative crystal, or one or more ligands or molecules i.e. a co-crystal. 

The term "associate", "association" or "associating" refers to a condition of proximity between a 
moiety (i.e. chemical entity or compound or portions or fragments thereof), and a binding pocket. The 
20 association may be non-covalent i.e. where the juxtaposition is energetically favored by for example, 
hydrogen-bonding, van der Waals, or electrostatic or hydrophobic interactions, or it may be covalent. 

The term ''heavy-metal atoms" refers to an atom mat can be used to solve an x-ray 
crystallography phase problem, including but not limited to a transition element, a lanthanide metal, or an 
actinide metal. Lanthanide metals include elements with atomic numbers between 57 and 71, inclusive. 
25 Actinide metals include elements with atomic numbers between 89 and 103, inclusive. 

Multiwavelength anomalous diffraction (MAD) phasing may be used to solve protein structures 
using selenomethionyl (SeMet) proteins. Therefore, a complex of the invention may comprise a crystalline 
binding pocket with selenium on the methionine residues of the protein. 

A crystal may comprise a complex between a binding pocket and one or more ligands or 
30 molecules. In other words the binding pocket may be associated with one or more ligands or molecules in 
the crystal. The ligand may be any compound mat is capable of stably and specifically associating with the 
binding pocket. A ligand may, for example, be a modulator of an Eph receptor, or a nucleotide or substrate 
or analogue thereof. 

In an embodiment of the invention, a binding pocket is in association with a cofactor in the 
35 crystal. A "cofactor" refers to a molecule required for RTK enzyme activity and/or stability. For example, 
the cofactor may be a metal ion, including magnesium and other similar atoms or metals. 

In an embodiment, a crystal of the invention comprises a complex between a binding pocket, and 
a nucleotide or analogue thereof and/or a substrate or analogue thereof. A "nucleotide" includes ATP, 
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ADP, AMP, or analogues thereof, for example, p.y-imidoadenosine-S'-triphosphate (AMP-PNP, STI-571, 
and quercetin. A substrate may be for example, a signalling protein, or another portion of the same RTK 
(e.g juxtamembrane-kinase domain complex). An analog of a nucleotide or substrate is one which mimics 
the nucleotide or substrate molecule, binding in the binding pocket, but which is incapable (or has a 
5 significantly reduced capacity) to take part in a kinase reaction. 
Therefore, the present invention also provides: 

(a) a crystal comprising a binding pocket of an RTK and a nucleotide or analogue thereof; 

(b) a crystal comprising a binding pocket of an RTK and a substrate or analogue thereof; 

(c) a crystal comprising a binding pocket of an RTK and a nucleotide or analogue thereof, and a 
1 0 substrate or analogue thereof . 

A complex may comprise one or more of the intermolecular interactions identified in Table 2. A 
structure of a complex of the invention may be defined by selected intermolecular contacts, preferably the 
structural coordinates of the intermolecular contacts as defined in Table 3. 

A crystal of the invention may enable the determination of structural data for a ligand In order to 
15 be able to derive structural data for a ligand, it is necessary for the molecule to have sufficiently strong 
electron density to enable a model of the molecule to be built using standard techniques. For example, 
there should be sufficient electron density to allow a model to be built using XTALVIEW (McRee 1992 J. 
Mol. Graphics. 10 44-46). 

Illustrations of particular crystals of the invention are shown in Figures 2, 3, and 4. 
20 METHOD OF MAKING A CRYSTAL 

The present invention also provides a method of making a crystal according to the invention. The 
crystal may be formed from an aqueous solution comprising a purified polypeptide comprising an RTK, in 
particular an Eph receptor including a variant, part, homolog, or fragment thereof (e.g. a binding pocket). 
A method may utilize a purified polypeptide comprising a binding pocket to form a crystal. A method may 
25 utilize a purified polypeptide comprising a juxtamembrane region and kinase domain of an RTK, in 
particular an Eph receptor, preferably an EphB receptor, or more preferably an EphB2 receptor. 

The term "purified" in reference to a polypeptide, does not require absolute purity such as a 
homogenous preparation rather it represents an indication that the polypeptide is relatively purer than in 
the natural environment. Generally, a purified polypeptide is substantially free of other proteins, lipids, 
30 carbohydrates, or other materials with which it is naturally associated, preferably at a functionally 
significant level for example at least 85% pure, more preferably at least 95% pure, most preferably at least 
99% pure. A skilled artisan can purify a polypeptide comprising using standard techniques for protein 
purification. A substantially pure polypeptide will yield a single major band on a non-reducing 
polyacrylamide gel. Purity of the polypeptide can also be determined by amrino-tenninal amino acid 
35 sequence analysis. 

A polypeptide used in the method may be chemically synthesized in whole or in part using 
techniques that are well-known in the art Alternatively, methods are well known to the skilled artisan to 
construct expression vectors containing a native or mutated RTK coding sequence and appropriate 
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transcriptional/translational control signals. These methods include in vitro recombinant DNA techniques, 
synthetic techniques, and in vivo recombination/genetic recombination. See for example the techniques 
described in Sambrook et al. (Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor 
Laboratory press (1989)), and other laboratory textbooks. (See also Sarker et al, Glycoconjugate J. 7:380, 
5 1990; Sarker et al, Proc. Natl Acad, Sci. USA 88:234-238, 1991, Sarker et al, Glycoconjugate L 11: 204- 
209, 1994; Hull et al, Biochem Biophys Res Commun 176:608, 1991 and Pownall et al, Genomics 12:699- 
704, 1992). 

Crystals may be grown from an aqueous solution containing the purified polypeptide by a variety 
of conventional processes. These processes include batch, liquid, bridge, dialysis, vapor diffusion, and 

10 hanging drop methods. (See for example, McPherson, 1982 John Wiley, New York; McPherson, 1990, 
Eur, J. Biochem. 189: 1-23; Webber. 1991, Adv. Protein Chem, 41:1-36). Generally, native crystals of the 
invention are grown by adding precipitants to the concentrated solution of the polypeptide. The 
precipitants are added at a concentration just below that necessary to precipitate the protein. Water is 
removed by controlled evaporation to produce precipitating conditions, which are maintained until crystal 

15 growth ceases. 

Derivative crystals of the invention can be obtained by soaking native crystals in a solution 
containing salts of heavy metal atoms. A complex of the invention can be obtained by soaking a native 
crystal in a solution containing a compound that binds the polypeptide, or they can be obtained by co- 
crystallizing the polypeptide in the presence of one or more compounds. In order to obtain co-crystals 
20 with a compound which binds deep within the tertiary structure of the polypeptide it is necessary to use the 
second method. 

In a preferred embodiment, the polypeptide is co-crystallised with a compound which stabilises 
the polypeptide (e.g. AMP-PNP). 

Once the crystal is grown it can be placed in a glass capillary tube and mounted onto a holding 

25 device connected to an X-ray generator and an X-ray detection device. Collection of X-ray diffraction 
patterns are well documented by those skilled in the art (See for example, Ducruix and Geige, 1992, IRL 
Press, Oxford, England). A beam of X-rays enter the crystal and diffract from the crystal. An X-ray 
detection device can be utilized to record the diffraction patterns emanating from the crystal. Suitable 
devices include the Marr 345 imaging plate detector system with an RU200 rotating anode generator. 

30 Multiwavelength anomalous diffraction (MAD) phasing using selenomethionyl (SeMet) proteins 

may be used to determine a crystal of the invention. Thus, the invention contemplates a method for 
determining a crystal structure of the invention using a selenomethionyl derivative of an RTK, including a 
variant, part, homolog or fragement thereof. 

Methods for obtaining the three dimensional structure of the crystalline form of a molecule or 

35 complex are described herein and known to those skilled in the art (see Ducruix and Geige 1992, IRL 
Press, Oxford, England). Generally, the x-ray crystal structure is given by the diffraction patterns. Each 
diffraction pattern reflection is characterized as a vector and the data collected at this stage determines the 
amplitude of each vector. The phases of the vectors may be determined by the isomorphous replacement 
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method where heavy atoms soaked into the crystal are used as reference points in the X-ray analysis (see 
for example, Otwinowski, 1991, Daresbury, United Kingdom, 80-86). The phases of the vectors may also 
be determined by molecular replacement (see for example, Naraza, 1994, Proteins 11:281-296). The 
amplitudes and phases of vectors from the crystalline form determined in accordance with these methods 
5 can be used to analyze other related crystalline polypeptides. 

The unit cell dimensions and symmetry, and vector amplitude and phase information can be used 
in a Fourier transform function to calculate the electron density in the unit cell i.e. to generate an 
experimental electron density map. This may be accomplished using the PHASES package (Furey, 1990). 
Amino acid sequence structures are fit to the experimental electron density map (Le. model building) using 

10 computer programs (e.g. Jones, TA. et al, Acta Crystallogr A47, 100-1 19, 1991). This structure can also be 
used to calculate a theoretical electron density map. The theoretical and experimental electron density 
maps can be compared and the agreement between the maps can be described by a parameter referred to as 
R-factor. A high degree of overlap in the maps is represented by a low value R-factor. The R-factor can be 
rninimized by using computer programs that refine the structure to achieve agreement between the 

15 theoretical and observed electron density map. For example, the XPLOR program, developed by Brunger 
(1992, Nature 355:472-475) can be used for model refinement. 

A three dimensional structure of the molecule or complex may be described by atoms that fit the 
theoretical electron density characterized by a minimum R value. Files can be created for the structure that 
defines each atom by coordinates in three dimensions. 

20 MODEL 

A crystal structure of the present invention may be used to make a model of a binding pocket of 
an RTK, preferably an Eph receptor, more preferably an EphB receptor. A model may, for example, be a 
structural model or a computer model. A model may represent the secondary, tertiary and/or quaternary 
structure of the binding pocket. The model itself may be in two or three dimensions. It is possible for a 
25 computer model to be in three dimensions despite the constraints imposed by a conventional computer 
screen, if it is possible to scroll along at least a pair of axes, causing "rotation" of the image. 

As used herein, the term "modelling" includes the quantitative and qualitative analysis of 
molecular structure and/or function based on atomic structural information and interaction models. The 
term "modelling" includes conventional numeric-based molecular dynamic and energy minimization 
30 models, interactive computer graphic models, modified molecular mechanics models, distance geometry 
and other structure-based constraint models. 

Preferably, modelling is performed using a computer and may be further optimized using known 
methods. This is called modelling optimisation. 

An integral step to an approach of the invention for designing modulators (e.g. inhibitors) of a 
35 subject receptor involves construction of computer graphics models of the binding pocket of a receptor 
which can be used to design pharmacophores by rational drug design. For instance, for an inhibitor to 
interact optimally with the subject binding pocket, it will generally be desirable that it have a shape which 
is at least partly complimentary to that of a particular binding pocket of the receptor, as for example those 
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binding pockets of the receptor which are involved in recognition of a ligand, regulating the kinase 
domain, or regulating signal transduction. Additionally, other factors, including electrostatic interactions, 
hydrogen bonding, hydrophobic interactions, desolvation effects, and cooperative motions of ligand and 
receptor, all influence the binding effect and should be taken into account in attempts to design bioactive 
5 modulators (e.g. inhibitors). 

As described herein, a computer-generated molecular model of the subject receptors can be 
created. In preferred embodiments, at least the Cot-carbon positions of the RTK sequence of interest are 
mapped to a particular coordinate pattern, such as the coordinates for a binding pocket of an EphB2 shown 
in Table 3, by homology modeling, and the structure of the protein and velocities of each atom are 

10 calculated at a simulation temperature (T 0 ) at which the docking simulation is to be determined. Typically, 
such a protocol involves primarily the prediction of side-chain conformations in the modeled protein, while 
assuming a main-chain trace taken from a tertiary structure such as provided in Table 3 and the Figures. 
Computer programs for performing energy minimization routines are commonly used to generate 
molecular models. For example, both the CHARMM (Brooks et al. (1983) J Comput Chem 4:187-217) 

15 and AMBER (Weiner et al (1981) 1 Comput. Chem. 106: 765) algorithms handle all of the molecular 
system setup, force field calculation, and analysis (see also, Eisenfield et al. (1991) Am J Physiol 
261:C376-386; Lybrand (1991) J Pharm Belg 46:49-54; Froimowitz (1990) Biotechniques 8:640-644; 
Burbam et al. (1990) Proteins 7:99-11 1; Pedersen (1985) Environ Health Perspect 61:185-190; and Kini 
et al. (1991) J Biomol Struct Dyn 9:475-488). At the heart of these programs is a set of subroutines that, 

20 given the position of every atom in the model, calculate the total potential energy of the system and the 
force on each atom. These programs may utilize a starting set of atomic coordinates, such as the 
coordinates provided in Table 3, the parameters for the various terms of the potential energy function, and 
a description of the molecular topology (the covalent structure). Common features of such molecular 
modeling methods include: provisions for handling hydrogen bonds and other constraint forces; the use of 

25 periodic boundary conditions; and provisions for occasionally adjusting positions, velocities, or other 
parameters in order to maintain or change temperature, pressure, volume, forces of constraint, or other 
externally controlled conditions. 

Most conventional energy minimization methods use the input data described above and the fact 
mat the potential energy function is an explicit, differentiable function of Cartesian coordinates, to 

30 calculate the potential energy and its gradient (which gives the force on each atom) for any set of atomic 
positions. This information can be used to generate a new set of coordinates in an effort to reduce the total 
potential energy and, by repeating this process over and over, to optimize the molecular structure under a 
given set of external conditions. These energy minimization methods are routinely applied to molecules 
similar to the subject RTK proteins as well as nucleic acids, polymers and zeolites. 

35 In general, energy minimization methods can be carried out for a given temperature, Tj, which 

may be different than the docking simulation temperature, T 0 . Upon energy minimization of the molecule 
at Ti, coordinates and velocities of all the atoms in the system are computed. Additionally, the normal 
modes of the system are calculated. It will be appreciated by those skilled in the art that each normal mode 
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is a collective, periodic motion, with all parts of the system moving in phase with each other, and that the 
motion of the molecule is the superposition of all normal modes. For a given temperature, the mean square 
amplitude of motion in a particular mode is inversely proportional to the effective force constant for that 
mode, so that die motion of the molecule will often be dominated by the low frequency vibrations. 
5 After the molecular model has been energy minimized at Tj, the system is "heated" or "cooled" to 

the simulation temperature, T 0 , by carrying out an equilibration run where the velocities of the atoms are 
scaled in a step-wise manner until the desired temperature, T 0 , is reached. The system is further 
equilibrated for a specified period of time until certain properties of the system, such as average kinetic 
energy, remain constant The coordinates and velocities of each atom are then obtained from the 

10 equilibrated system. 

Further energy minimization routines can also be carried out. For example, a second class of 
methods involves calculating approximate solutions to the constrained EOM for the protein. These 
methods use an iterative approach to solve for the Lagrange multipliers and, typically, only need a few 
iterations if the corrections required are small. The most popular method of this type, SHAKE (Ryckaert 

15 et al (1977) J Comput Phys 23:327; and Van Gunsteren et al. (1977) Mol Phys 34:1311) is easy to 
implement and scales as 0(N) as the number of constraints increases. Therefore, the method is applicable 
to macromolecules such as the RTK proteins of the present invention. An alternative method, RATTLE 
(Anderson (1983) J Comput Phys 52:24) is based on the velocity version of the Verlet algorithm. Like 
SHAKE, RATTLE is an iterative algorithm and can be used to energy minimize the model of the subject 

20 protein. 

Overlays and super positioning with a three dimensional model of a binding pocket of the 
invention may be used for modelling optimisation. Additionally alignment and/or modelling can be used as 
a guide for the placement of mutations on a binding pocket to characterize the nature of the site in the 
context of a cell. 

25 The three dimensional structure of a new crystal may be modelled using molecular replacement. 

The term "molecular replacement" refers to a method that involves generating a preliminary model of a 
molecule or complex whose structural coordinates are unknown, by orienting and positioning a molecule 
whose structural coordinates are known within the unit cell of the unknown crystal, so as best to account 
for the observed diffraction pattern of the unknown crystal. Phases can then be calculated from this model 

30 and combined with the observed amplitudes to give an approximate Fourier synthesis of the structure 
whose coordinates are unknown. This, in turn, can be subject to any of the several forms of refinement to 
provide a final, accurate structure of the unknown crystal. Lattman, E., "Use of the Rotation and 
Translation Functions", in Methods in Enzymology, 115, pp. 55-77 (1985); M. G. Rossmann, ed., "The 
Molecular Replacement Method", Int Sci. Rev. Ser., No. 13, Gordon & Breach, New York, (1972). 

35 Commonly used computer software packages for molecular replacement are X-PLOR (Brunger 

1992, Nature 355: 472-475), AMoRE (Navaza, 1994, Acta Crystallogr. A50:157-163), the CCP4 package 
(Collaborative Computational Project, Number 4, "The CCP4 Suite: Programs for Protein 
Crystallography", Acta Cryst., Vol. D50, pp. 760-763, 1994), the MERLOT package (P.M.D. Fitzgerald, 
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J. Appl. Cryst., Vol. 21, pp. 273-278, 1988) and XTALVEBW (McCree et al (1992) J. MoL Graphics 10: 
44-46. It is preferable that the resulting structure not exhibit a root-niean-square deviation of more than 3 
A. 

Molecular replacement computer programs generally involve the following steps: (1) 
5 determining the number of molecules in the unit cell and defining the angles between them (self rotation 
function); (2) rotating the known structure against diffraction data to define the orientation of the 
molecules in the unit cell (rotation function); (3) translating the known structure in three dimensions to 
correctly position the molecules in the unit cell (translation function); (4) deterniining the phases of the X- 
ray diffraction data and calculating an R-factor calculated from the reference data set and from the new 
10 data wherein an R-factor between 30-50% indicates that the orientations of the atoms in the unit cell have 
been reasonably determined by the method; and (5) optionally, decreasing the R-factor to about 20% by 
refining the new electron density map using iterative refinement techniques known to those skilled in the 
art (refinement). 

The quality of the model may be analysed using a program such as PROCHECK or 3D-Profiler 
15 [LaskowsM et al 1993 J. Appl. Cryst 26:283-291; Luthy R. et al, Nature 356: 83-85, 1992; and Bowie, 
J.U. et al, Science 253: 164-170, 1991]. Once any irregularities have been resolved, the entire structure 
may be further refined. 

Other molecular modelling techniques may also be employed in accordance with this invention. 
See, e.g., Cohen, N. C. et al, "Molecular Modelling Software and Methods for Medicinal Chemistry", J. 

20 Med. Chem., 33, pp. 883-894 (1990). See also, Navia, M. A. and M. A. Murcko, "The Use of Structural 
Information in Drug Design", Current Opinions in Structural Biology, 2, pp. 202-210 (1992). 

Using the structural coordinates of crystal provided by the invention, molecular modelling may be 
used to determine the structural coordinates of a crystalline mutant or homolog of an RTK binding pocket. 
By the same token a crystal of the invention can be used to provide a model of a ligand. Modelling 

25 techniques can then be used to approximate the three dimensional structure of ligand derivatives and other 
components which may be able to mimic the atomic contacts between a ligand and binding pocket. 
COMPUTER FORMAT OF CRYSTALS/MODELS 

Information derivable from a crystal of the present invention (for example the structural 
coordinates) and/or the model of the present invention may be provided in a computer-readable format 

30 Therefore, the invention provides a computer readable medium or a machine readable storage 

medium which comprises the structural coordinates of a binding pocket of an RTK including all or any 
parts thereof, or ligands including portions thereof. Such storage medium or storage medium encoded with 
these data are capable of displaying on a computer screen or similar viewing device, a three-dimensional 
graphical representation of a molecule or molecular complex which comprises such binding pockets or 

35 similarly shaped homologous binding pockets. Thus, the invention also provides computerized 
representations of the secondary or three-dimensional structures of a binding pocket of the invention, 
including any electronic, magnetic, or electromagnetic storage forms of the data needed to define the 
structures such that the data will be computer readable for purposes of display and/or manipulation. 
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In an aspect the invention provides a computer for producing a three-dimensional representation 
of a molecule or molecular complex, wherein said molecule or molecular complex comprises a binding 
pocket defined by structural coordinates of a binding pocket or structural coordinates of atoms of a ligand, 
or a three-dimensional representation of a homolog of said molecule or molecular complex, wherein said 
5 homolog comprises a binding pocket or ligand that has a root mean square deviation from the backbone 
atoms not more than 1 .5 angstroms wherein said computer comprises: 

(a) a machine-readable data storage medium comprising a data storage material encoded with 
machine readable data wherein said data comprises the structural coordinates of a binding pocket 
of an RTK or a ligand according to Table 3; 
10 (b) a working memory for storing instructions for processing said machine-readable data; 

(c) a central-processing unit coupled to said working memory and to said machine-readable data 
storage medium for processing said machine readable data into said three-dimensional 
representation; and 

(d) a display coupled to said central-processing unit for displaying said tliree-dimensional 
15 representation. 

The invention also provides a computer for determining at least a portion of the structural 
coordinates corresponding to an X-ray diffraction pattern of a molecule or molecular complex wherein said 
computer comprises: 

(a) a machine-readable data storage medium comprising a data storage material encoded with 
20 machine readable data wherein, said data comprises the structural coordinates according to 

Table 3; 

(b) a machine-readable data storage medium comprising a data storage material encoded with 
machine readable data wherein said data comprises an X-ray diffraction pattern of said 
molecule or molecular complex; 

25 (c) a working memory for storing instructions for processing said machine-readable data of (a) 

and(b); 

(d) a central-processing unit coupled to said working memory and to said machine-readable data 
storage medium of (a) and (b) for performing a Fourier transform of the machine readable 
data of (a) and for processing said machine readable data of (b) into structural coordinates; 

30 and 

(e) a display coupled to said central-processing unit for displaying said structural coordinates of 
said molecule or molecular complex. 

STRUCTURAL STUDIES 

The present invention also provides a method for determining the secondary and/or tertiary 
35 structures of a polypeptide or part thereof by using a crystal, or a model according to the present invention. 
The polypeptide or part thereof may be any polypeptide or part thereof for which the secondary and or 
tertiary structure is uncharacterised or incompletely characterised. In a preferred embodiment the 
polypeptide shares (or is predicted to share) some structural or functional homology to a crystal of the 
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present invention. For example, the polypeptide may show a degree of structural homology over some or 
all parts of the primary amino acid sequence. 

The polypeptide may be an RTK, preferably an Eph receptor with a different specificity for a 
nucleotide, or substrate. The polypeptide may be an RTK preferably an Eph receptor which requires a 
5 different metal cofactor. Alternatively (or in addition) the polypeptide may be an RTK, preferably an Eph 
receptor from a different species. 

The polypeptide may be a mutant of a wild-type RTK, in particular an Eph receptor. A mutant 
may arise naturally, or may be made artificially (for example using molecular biology techniques). The 
mutant may also not be "made" at all in the conventional sense, but merely tested theoretically using the 
1 0 model of the present invention. A mutant may or may not be functional. 

Thus, using a model of the present invention, the effect of a particular mutation on the overall two 
and/or three dimensional structure of an RTK, in particular an Eph receptor, the autoinhibited state or 
active state, and/or the interaction between a binding pocket of the enzyme and a ligand can be 
investigated. 

15 Alternatively, the polypeptide may perform an analogous function or be suspected to show a 

similar catalytic mechanism to an RTK, in particular an Eph receptor. 

The polypeptide may also be the same as the polypeptide of the crystal, but in association with a 
different ligand (for example, modulator or inhibitor) or cofactor, In this way it is possible to investigate 
the effect of altering the ligand or compound with which the polypeptide is associated on the structure of 

20 the binding pocket. 

Secondary or tertiary structure may be determined by applying the structural coordinates of the 
crystal or model of the present invention to other data such as an amino acid sequence, X-ray 
crystallographic diffraction data, or nuclear magnetic resonance (NMR) data. Homology modeling, 
molecular replacement, and nuclear magnetic resonance methods using these other data sets are described 

25 below. 

Homology modeling (also known as comparative modeling or knowledge-based modeling) 
methods develop a three dimensional model from a polypeptide sequence based on the structures of known 
proteins (i.e. an RTK, in particular an Eph receptor, of the crystal). The method utilizes a computer model 
of a crystal of the present invention (the "known structure"), a computer representation of the amino acid 

30 sequence of the polypeptide with an unknown structure, and standard computer representations of the 
structures of amino acids. The method in particular comprises the steps of; (a) identifying structurally 
conserved and variable regions in the known structure; (b) aligning the amino acid sequences of the known 
structure and unknown structure (c) generating co-ordinates of main chain atoms and side chain atoms in 
structurally conserved and variable regions of the unknown structure based on the coordinates of the 

35 known structure thereby obtaining a homology model; and (d) refining the homology model to obtain a 
three dimensional structure for the unknown structure. This method is well known to those skilled in the 
art (Greer, 1985, Science 228, 1055; Bundell et al 1988, Eur. J. Biochem. 172, 513; Knighton et al., 1992, 
Science 258:130-135, http://biochem.vt.edu/courses/modeling/ homology.htn). Computer programs that 
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can be used in homology modelling are Quanta and the Homology module in the Insight II modelling 
package distributed by Molecular Simulations Inc, or MODELLER (Rockefeller University, 
ww.iucr.ac.uk/sinris-top/logical/prg-modeller.html). 

In step (a) of the homology modelling method, a known structure is examined to identify the 
5 structurally conserved regions (SCRs) from which an average structure, or framework, can be constructed 
for these regions of the protein. Variable regions (VRs), in which known structures may differ in 
conformation, also must be identified. SCRs generally correspond to the elements of secondary structure, 
such as alpha-helices and beta-sheets, and to ligand- and substrate-binding sites (e.g. nucleotide binding 
sites). The VRs usually lie on the surface of the proteins and form the loops where the main chain turns. 

10 Many methods are available for sequence alignment of known structures and unknown structures. 

Sequence alignments generally are based on the dynamic prograrnming algorithm of Needleman and 
Wunsch [J. Mol. Biol. 48: 442-453, 1970]. Current methods include FASTA, Smith-Waterman, and 
BLASTP, with the BLASTP method differing from the other two in not allowing gaps. Scoring of 
alignments typically involves construction of a 20x20 matrix in which identical amino acids and those of 

15 similar character (i.e., conservative substitutions) may be scored higher than those of different character. 
Substitution schemes which may be used to score alignments include the scoring matrices PAM (DayhofF 
et al., Meth. Enzymol. 91: 524-545, 1983), and BLOSUM (Henikoff and Henikoff, Proc. Nat Acad. Sci. 
USA 89: 10915- 0919, 1992), and the matrices based on alignments derived from three-dimensional 
structures including that of Johnson and Overington (JO matrices) (J. Mol. Biol. 233: 716-738, 1993). 

20 Alignment based solely on sequence may be used; however, other structural features also may be 

taken into account In Quanta, multiple sequence alignment algorithms are available that may be used 
when aligning a sequence of the unknown with the known structures. Four scoring systems (i.e. sequence 
homology, secondary structure homology, residue accessibility homology, CA-CA distance homology) are 
available, each of which may be evaluated during an alignment so that relative statistical weights may be 

25 assigned. 

When generating coordinates for the unknown structure, main chain atoms and side chain atoms, 
both in SCRs and VRs need to be modelled. A variety of approaches known to those skilled in the art may 
be used to assign co-ordinates to the unknown. In particular, the co-ordinates of the main chain atoms of 
SCRs will be transferred to the unknown structure. VRs correspond most often to the loops on the surface 

30 of the polypeptide and if a loop in the known structure is a good model for the unknown, then the main 
chain co-ordinates of the known structure may be copied. Side chain coordinates of SCRs and VRs are 
copied if the residue type in the unknown is identical to or very similar to that in the known structure. For 
other side chain coordinates, a side chain rotamer library may be used to define the side chain coordinates. 
When a good model for a loop cannot be found fragment databases may be searched for loops in other 

35 proteins that may provide a suitable model for the unknown. If desired, the loop may then be subjected to 
conformational searching to identify low energy confonners if desired. 

Once a homology model has been generated it is analyzed to determine its correctness. A 
computer program available to assist in this analysis is the Protein Health module in Quanta which 
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provides a variety of tests. Other programs that provide structure analysis along with output include 
PROCHECK and 3D-Profiler [Luthy R et al, Nature 356: 83-85, 1992; and Bowie, J.U. et al, Science 253: 
164-170, 1991], Once any irregularities have been resolved, the entire structure may be further refined. 
Refinement may consist of energy minimization with restraints, especially for the SCRs. Restraints may be 
5 gradually removed for subsequent minimizations. Molecular dynamics may also be applied in conjunction 
with energy minimization. 

Molecular replacement involves applying a known structure to solve the X-ray crystallographic 
data set of a polypeptide of unknown structure. The method can be used to define the phases describing the 
X-ray diffraction data of a polypeptide of unknown structure when only the amplitudes are known. Thus 

10 in an embodiment of the invention, a method is provided for determining three dimensional structures of 
polypeptides with unknown structure by applying the structural coordinates of a crystal of the present 
invention to provide an X-ray crystallographic data set for a polypeptide of unknown structure, and (b) 
detennining a low energy conformation of the resulting structure. 

The structural coordinates of a crystal of the present invention may be applied to nuclear magnetic 

15 resonance (NMR) data to determine the three dimensional structures of polypeptides with uncharacterised 
or incompletely characterised sturcture. (See for example, Wuthrich, 1986, John Wiley and Sons, New 
York: 176-199; Pflugrath et al, 1986, J. Molecular Biology 189: 383-386; Kline et al, 1986 J. Molecular 
Biology 189:377-382). While the secondary structure of a polypeptide may often be determined by NMR 
data, the spatial connections between individual pieces of secondary structure are not as readily 

20 determined. The structural coordinates of a polypeptide defined by X-ray crystallography can guide the 
NMR spectroscopist to an understanding of the spatial interactions between secondary structural elements 
in a polypeptide of related structure. Information on spatial interactions between secondary structural 
elements can greatly simplify Nuclear Overhauser Effect (NOE) data from two-dimensional NMR 
experiments. In addition, applying the structural coordinates after the determination of secondary structure 

25 by NMR techniques simplifies the assignment of NOE's relating to particular amino acids in the 
polypeptide sequence and does not greatly bias the NMR analysis of polypeptide structure. 

In an embodiment, the invention relates to a method of determining three dimensional structures 
of polypeptides with unknown structures, by applying the structural coordinates of a crystal of the present 
invention to nuclear magnetic resonance (NMR) data of the unknown structure. This method comprises the 

30 steps of: (a) deterrnining the secondary structure of an unknown structure using NMR data; and (b) 
simplifying the assignment of through-space interactions of amino acids. The term " through-space 
interactions" defines the orientation of the secondary structural elements in the three dimensional structure 
and the distances between amino acids from different portions of the amino acid sequence. The term 
"assignment" defines a method of analyzing NMR data and identifying which amino acids give rise to 

35 signals in the NMR spectrum. 
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SCREENING METHODS 

Another aspect of the present invention is the design and identification of agents that inhibit or 
potentiate an autoinhibition state or active state of an RTK. The rationale design and identification of 
agents can be accomplished by utilizing the structural coordinates that define a binding pocket of an RTK. 
5 The structures described herein, and the structures of other polypeptides determined by homology 

modeling, molecular replacement, and NMR techniques described herein can also be applied to modulator 
design and identification methods. 

The invention contemplates molecular models, in particular three-dimensional molecular models 
of RTK proteins, and their use as templates for the design of agents able to mimic or inhibit ligand 
10 activation or autophosphorylation or phoshorylation of the proteins (e.g. modulators). A modulator may 
inhibit or potentiate an autoinhibited state or alternatively an active state. 

In certain embodiments, the present invention provides a method of screening for a ligand that 
associates with a binding pocket and/or modulates the function of an Eph receptor by using a crystal or a 
model according to the present invention. The method may involve investigating whether a test compound 
15 is capable of associating with or binding a binding pocket, and/or inhibiting or enhancing interactions of 
atomic contacts in a binding pocket 

In accordance with an aspect of the present invention, a method is provided for screening for a 
ligand capable of binding to a binding pocket, wherein the method comprises using a crystal or model 
according to the invention. 

20 In another aspect, the invention relates to a method of screening for a ligand capable of binding to 

a binding pocket, wherein the binding pocket is defined by the structural coordinates given herein, the 
method comprising contacting the binding pocket with a test compound and determining if the test 
compound binds to the binding pocket. The binding pocket may be a binding pocket of an autoinhibited 
state or an active state. In the case of an autoinhibited state binding pocket the screening method may 
25 potentially identify an inhibitor that may disrupt catalytic activity of an RTK, for example/ by maintaining 
the RTK in an autoinhibited state. A disruption of catalytic activity may be useful in the treatment of 
conditions involving increased RTK activity e.g. cancer. 

In one embodiment, the present invention provides a method of screening for a test compound 
capable of interacting with one or more key amino acid residues of a binding pocket of an RTK. For 
30 example, a test compound that interacts with one or more of Tyr/Phe604 , Tyr/Phe 6 10, Tyr 667, Tyr 744, 
and Tyr 750 of EphB2 receptor may prevent phosphorylation of one or more of the tyrosines and thereby 
promote the autoinhibited state of the receptor. 

Another aspect of the invention provides a process comprising the steps of: 
(a) performing a method of screening for a ligand described above; 
35 (b) identifying one or more ligands capable of binding to a binding pocket; and 

(c) preparing a quantity of said one or more ligands. 
A further aspect of the invention provides a process comprising the steps of; 
(a) performing a method of screening for a ligand as described above; 
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(b) identifying one or more iigands enable of binding to a binding pocket; and 

(c) preparing a pharmaceutical composition comprising said one or more Iigands. 

Once a test compound capable of interacting with one or more key amino acid residues in a 
binding pocket of an RTK has been identified, further steps may be carried out either to select and/or 
5 modify compounds and/or to modify existing compounds, to modulate the interaction with the key amino 
acid residues in the binding pocket. 

Yet another aspect of the invention provides a process comprising the steps of; 

(a) performing the method of screening for a ligand as described above; 

(b) identifying one or more Iigands capable of binding to a binding pocket; 

10 (c) moaning said one or more Iigands capable of binding to a binding pocket; 

(d) performing said method of screening for a ligand as described above; and 

(e) optionally preparing a pharmaceutical composition comprising said one or more Iigands. 

In another aspect of the invention, a method of screening for a test compound is provided 
comprising screening for test compounds that affect (inhibit or potentiate) a juxtamembrane- 

15 juxtamembrane interaction (e.g. interactions 25 to 49 in Table 2) or juxtamembrane-kinase interactions 
(e.g. interactions 1 to 24 in Table 2) described herein. 

As used herein, the term 'test compound" means any compound which is potentially capable of 
associating with a binding pocket, inhibiting or enhancing interactions of atomic contacts in a binding 
pocket, and/or inhibiting or potentiating an autoinhibited state or active state of an RTK. If, after testing, it 

20 is determined that the test compound does bind to the binding pocket, inhibits or enhances interactions of 
atomic contacts in a binding pocket, and/or inhibits or potentiates an autoinhibited or active state of an 
RTK, it is known as a "ligand". 

The test compound may be designed or obtained from a library of compounds which may 
comprise peptides, as well as other compounds, such as small organic molecules and particularly new lead 

25 compounds. By way of example, the test compound may be a natural substance, a biological 
macromolecule, or an extract made from biological materials such as bacteria, fungi, or animal 
(particularly mammalian) cells or tissues, an organic or an inorganic molecule, a synthetic test compound, 
a semi-synthetic test compound, a carbohydrate, a monosaccharide, an oligosaccharide or polysaccharide, a 
glycolipid, a glycopeptide, a saponin, a heterocyclic compound, a structural or functional mimetic, a 

30 peptide, a peptidomimetic, a derivatised test compound, a peptide cleaved from a whole protein, or a 
peptide synthesised synthetically (such as, by way of example, either using a peptide synthesizer or by 
recombinant techniques or combinations thereof), a recombinant test compound, a natural or a non-natural 
test compound, a fusion protein or equivalent thereof and mutants, derivatives or combinations thereof 

The increasing availability of biomacromolecule structures of potential pharmacophoric 

35 molecules that have been solved crystallographically has prompted the development of a variety of direct 
computational methods for molecular design, in which the steric and electronic properties of substrate 
binding sites are use to guide the design of potential Iigands (Cohen et al. (1990) 1 Med Cam. 33: 883- 
894; Kuntz et al. (1982) J. Mol Biol 161: 269-288; DesJariais (1988) J. Med Cam. 31: 722-729; Bartlett 
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et al. (1989) (Spec. Publ, Roy. Soc. Chem.) 78: 182-196; Goodford et al. (1985) 1 Med Cam. 28: 849- 
857; DesJarlais et al J. Med Cam. 29: 2149-2153). Directed methods generally fell into two categories: 
(1) design by analogy in which 3-D structures of known molecules (such as from a crystallographic 
database) are docked to the receptor structure and scored for goodness-of-fit; and (2) de novo design, in 
5 which the ligand model is constructed piece-wise in the receptor. The latter approach, in particular, can 
facilitate the development of novel molecules, uniquely designed to bind to the subject receptor. 

The test compound may be screened as part of a library or a data base of molecules. Modulators 
of inactivated/activated states of an RTK or binding pocket thereof may be identified by docking a 
computer representation of compounds from one or more data base of molecules. Data bases which may be 

10 used include ACD (Molecular Designs Limited), NCI (National Cancer Institute), CCDC (Cambridge 
Crystallographic Data Center), CAST (Chemical Abstract Service), Derwent (Derwent Information 
Limited), Maybridge (Maybridge Chemical Company Ltd), Aldrich (Aldrich Chemical Company), DOCK 
(University of California in San Francisco), and the Directory of Natural Products (Chapman & Hall). 
Computer programs such as CONCORD (Tripos Associates) or DB-Converter (Molecular Simulations 

15 Limited) can be used to convert a data set represented in two dimensions to one represented in three 
dimensions. 

Test compounds may tested for their capacity to fit spatially into a binding pocket As used 
herein, the term "fits spatially" means that the three-dimensional structure of the test compound is 
accommodated geometrically in a cavity of a binding pocket The test compound can then be considered 
20 to be a ligand. 

A favourable geometric fit occurs when the surface area of the test compound is in close 
proximity with the surface area of the cavity of a binding pocket without forming unfavorable interactions. 
A favourable complementary interaction occurs where the test compound interacts by hydrophobic, 
aromatic, ionic, dipolar, or hydrogen donating and accepting forces. Unfavourable interactions may be 
25 steric hindrance between atoms in the test compound and atoms in the binding pocket 

If a model of the present invention is a computer model, the test compounds may be positioned in 
a binding pocket through computational docking. If, on the other hand, the model of the present invention 
is a structural model, the test compounds may be positioned in the binding pocket by, for example, manual 
docking. 

30 As used herein the term "docking" refers to a process of placing a compound in close proximity 

with a binding pocket, or a process of finding low energy conformations of a test compound/ binding 
pocket complex. 

In an illustrative embodiment, the design of potential RTK, in particular EphB2 ligands begins 
from the general perspective of shape complementarity for an active site and substrate specificity subsites 
35 of the receptor, and a search algorithm is employed which is capable of scanning a database of small 
molecules of known three-dimensional structure for candidates which fit geometrically into the target 
protein site. It is not expected that the molecules found in the shape search will necessarily be leads 
themselves, since no evaluation of chemical interaction need necessarily be made during the initial search. 
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Rather, it is anticipated that such candidates might act as the framework for further design, providing 
molecular skeletons to which appropriate atomic replacements can be made. Of course, the chemical 
complimentarity of these molecules can be evaluated, but it is expected that atom types will be changed to 
maximize the electrostatic, hydrogen bonding, and hydrophobic interactions with the receptor. Most. 
5 algorithms of this type provide a method for finding a wide assortment of chemical structures that are 
complementary to the shape of a binding site of the subject receptor. Each of a set of small molecules 
from a particular data-base, such as the Cambridge Crystallographic Data Bank (CCDB) (Allen et al. 
(1973) J. Chem. Doc. 13:1 19), is individually docked to the binding pocket or site of an RTK, in particular 
an EphB2 receptor, in a number of geometrically permissible orientations with use of a docking algorithm. 

10 In a preferred embodiment, a set of computer algorithms called DOCK, can be used to characterize the 
shape of invaginations and grooves that form active sites and recognition surfaces of a subject receptor 
(Kuntz et al. (1982) 1 Mol Biol 161: 269-288). The program can also search a database of small 
molecules for templates whose shapes are complementary to particular binding pockets or sites of a 
receptor (DesJarlais et al. (1988) J Med Chem 31: 722-729). These templates normally require 

15 modification to achieve good chemical and electrostatic interactions (DesJarlais et aL (1989) ACS Symp 
Ser 413: 60-69). However, the program has been shown to position accurately known cofactors for ligands 
based on shape constraints alone. 

The orientations are evaluated for.goodness-of-fit and the best are kept for further examination 
using molecular mechanics programs, such as AMBER or CHARMM. Such algorithms have previously 

20 proven successful in finding a variety of molecules that are complementary in shape to a given binding site 
of a receptor, and have been shown to have several attractive features. First, such algorithms can retrieve a 
remarkable diversity of molecular architectures. Second, the best structures have, in previous applications 
to other proteins, demonstrated impressive shape complementarity over an extended surface area. Third, 
the overall approach appears to be quite robust with respect to small uncertainties in positioning of the 

25 candidate atoms. 

Goodford (1985, J Med Chem 28:849-857) and Boobbyer et al. (1989, J Med Chem 32:1083- 
1094) have produced a computer program (GRID) which seeks to determine regions of high affinity for 
different chemical groups (termed probes) on the molecular surface of the binding site. GRID hence 
provides a tool for suggesting modifications to known ligands that might enhance binding. It may be 

30 anticipated that some of the sites discerned by GRID as regions of high affinity correspond to 
"pharmacophoric patterns" determined inferentially from a series of known ligands. As used herein, a 
pharmacophore pattern is a geometric arrangement of features of the anticipated ligand that is believed to 
be important for binding. Attempts have been made to use pharmacophore patterns as a search screen for 
novel ligands (Jakes et al. (1987) J Mol Graph 5 :41-48; Brint et al. (1987) J Mol Graph 5:49-56; Jakes et 

35 al. (1986) J Mol Graph 4:12-20); however, the constraint of steric and "chemical" fit in the putative (and 
possibly unknown) receptor binding pocket or site is ignored. Goodsell and Olson (1990, Proteins: Struct 
Funct Genet 8:195-202) have used the Metropolis (simulated annealing) algorithm to dock a single known 
ligand into a target protein. They allow torsional flexibility in the ligand and use GRID interaction energy 
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maps as rapid lookup tables for computing approximate interaction energies. Given the large number of 
degrees of freedom available to the ligand, the Metropolis algorithm is time-consuming and is unsuited to 
searching a candidate database of a few thousand small molecules. 

Yet a further embodiment of the present invention utilizes a computer algorithm such as CLIX 
5 which searches such databases as CCDB for small molecules which can be oriented in a receptor binding 
pocket or site in a way that is both stericaily acceptable and has a high likelihood of achieving favorable 
chemical interactions between the candidate molecule and the surrounding amino acid residues. The 
method is based on characterizing a binding pocket in terms of an ensemble of favorable binding positions 
for different chemical groups and then searching for orientations of the candidate molecules that cause 

10 maximum spatial coincidence of individual candidate chemical groups with members of the ensemble. 
The current availability of computer power dictates that a computer-based search for novel ligands follows 
a breadth-first strategy. A breadth-first strategy aims to reduce progressively the size of the potential 
candidate search space by the application of increasingly stringent criteria, as opposed to a depth-first 
strategy wherein a maximally detailed analysis of one candidate is performed before proceeding to the 

15 next. CLIX conforms to this strategy in that its analysis of binding is rudimentary -it seeks to satisfy the 
necessary conditions of steric fit and of having individual groups in "correct" places for bonding, without 
imposing the sufficient condition that favorable bonding interactions actually occur. A ranked "shortlist" 
of molecules, in their favored orientations, is produced which can then be examined on a molecule-by- 
molecule basis, using computer graphics and more sophisticated molecular modeling techniques. CLIX is 

20 also capable of suggesting changes to the substituent chemical groups of the candidate molecules that 
might enhance binding. 

The algorithmic details of CLIX is described in Lawerence et al. (1992) Proteins 12:31-41, and 
the CLIX algorithm can be summarized as follows. The GRID program is used to determine discrete 
favorable interaction positions (termed target sites) in the binding pocket or site of the protein for a wide 

25 variety of representative chemical groups. For each candidate ligand in the CCDB an exhaustive attempt is 
made to make coincident, in a spatial sense in the binding site of the protein, a pair of the candidate's 
substituent chemical groups with a pair of corresponding favorable interaction sites proposed by GRID. 
All possible combinations of pairs of ligand groups with pairs of GRID sites are considered during this 
procedure. Upon locating such coincidence, the program rotates the candidate ligand about the two pairs 

30 of groups and checks for steric hindrance and coincidence of other candidate atomic groups with 
appropriate target sites. Particular candidate/orientation combinations that are good geometric fits in the 
binding site and show sufficient coincidence of atomic groups with GRID sites are retained. 

Consistent with the breadth-first strategy, this approach involves simplifying assumptions. Rigid 
protein and small molecule geometry is maintained throughout. As a first approximation rigid geometry is 

35 acceptable as the energy minimized coordinates of an RTK, in particular an EphB2 deduced structure, as 
described herein, describe an energy minimum for the molecule, albeit a local one. If the surface residues 
of the site of interest are not involved in crystal contacts then the crystal configuration of those residues is 
used merely as a starting point for energy minimization, and potential solution structures for those residues 
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determined. The deduced structure described herein should reasonably mimic the mean solution 
configuration. 

A further assumption implicit in CLDC is that the potential Hgand, when introduced into the 
binding pocket or site of a receptor, does not induce change in the protein's stereochemistry or partial 
5 charge distribution and so alter the basis on which the GRID interaction energy maps were computed. It 
must also be stressed that the interaction sites predicted by GRID are used in a positional and type sense 
only, i.e., when a candidate atomic group is placed at a site predicted as favorable by GRID, no check is 
made to ensure that the bond geometry, the state of protonation, or the partial charge distribution favors a 
strong interaction between the protein and that group. Such detailed analysis should form part of more 

10 advanced modeling of candidates identified in the CLIX shortlist 

Yet another embodiment of a computer-assisted molecular design method for identifying ligands 
of a binding pocket of an RTK comprises the de novo synthesis of potential ligands by algorithmic 
connection of small molecular fragments that will exhibit the desired structural and electrostatic 
complementarity with an active site or binding pocket of the receptor. The methodology employs a large 

15 template set of small molecules with are iterativery pieced together in a model of an RTK active site or 
binding pocket. Each stage of ligand growth is evaluated according to a molecular mechanics-based 
energy function, which considers van der Waals and coulombic interactions, internal strain energy of the 
lengthening ligand, and desolvation of both ligand and receptor. The search space can be managed by use 
of a data tree which is kept under control by pruning according to the binding criteria. 

20 In an illustrative embodiment, the search space is limited to consider only amino acids and amino 

acid analogs as the molecular building blocks. Such a methodology generally employs a large template 
set of amino acid conformations, though need not be restricted to just the 20 natural amino acids, as it can 
easily be extended to include other related fragments of interest to the medicinal chemist, e.g. amino acid 
analogs. The putative ligands that result from this construction method are peptides and peptide-like 

25 compounds rather than the small organic molecules that are typically the goal of drug design research. The 
appeal of the peptide building approach is not that peptides are preferable to organics as potential 
pharmaceutical agents, but rather that: (1) they can be generated relatively rapidly de novo; (2) their 
energetics can be studied by well-parameterized force field methods; (3) they are much easier to synthesize 
than are most organics; and (4) they can be used in a variety of ways, for peptidomimetic ligand design, 

30 protein-protein binding studies, and even as shape templates in the more commonly used 3D organic 
database search approach described above. 

Such a de novo peptide design method has been incorporated in a software package called GROW 
(Moon et al. (1991) Proteins 11:314-328). In a typical design session, standard interactive graphical 
modeling methods are employed to define the structural environment in which GROW is to operate. For 

35 instance, environment could be an active site binding pocket of an RTK, in particular an EphB2, or ft could 
be a set of features on the protein's surface to which the user wishes to bind a peptide-like molecule. The 
GROW program then operates to generate a set of potential ligand molecules. Interactive modeling 
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methods then come into play again, for examination of the resulting molecules, and for selection of one or 
more of them for further refinement. 

To illustrate, GROW operates on an atomic coordinate file generated by the user in the interactive 
modeling session, such as the coordinates provided in Table 3, or the coordinates of a binding pocket or 
5 active site as described in Table 2 and 3 plus a small fragment (e.g., an acetyl group) positioned in the 
active site to provide a starting point for peptide growth. These are referred to as "site" atoms and "seed" 
atoms, respectively. A second file provided by the user contains a number of control parameters to guide 
the peptide growth (Moon et aL (1991) Proteins 1 1 :3 14-328). 

The operation of the GROW algorithm is conceptually fairly simple. GROW proceeds in an 

10 iterative fashion, to systematically attach to the seed fragment each amino acid template in a large 
preconstructed library of amino acid conformations. When a template has been attached, it is scored for 
goodness-of-fit to the receptor site or binding pocket, and then the next template in the library is attached 
to the seed. After all the templates have been tested, only the highest scoring ones are retained for the next 
level of growth. This procedure is repeated for the second growth level; each Library template is attached 

15 in turn to each of the bonded seed/amino acid molecules that were retained from the first step, and is then 
scored Again, only the best of the bonded seed/dipeptide molecules that result are retained for the third 
level of growth. The growth of peptides can proceed in the N-to-C direction only, the reverse direction 
only, or in alternating directions, depending on the initial control specifications supplied by the user. 
Successive growth levels therefore generate peptides that are lengthened by one residue. The procedure 

20 terminates when the user-defined peptide length has been reached, at which point the user can select from 
the constructed peptides those to be studied further. The resulting data provided by the GROW procedure 
includes not only residue sequences and scores, but also atomic coordinates of the peptides, related directly 
to the coordinate system of the receptor site atoms. 

In yet another embodiment, potential phannacophoric compounds can be determined using a 

25 method based on an energy minimization-quenched molecular dynamics algorithm for determining 
energetically favorable positions of functional groups in the binding pockets of the subject receptor. The 
method can aid in the design of molecules that incorporate such functional groups by modification of 
known ligands or de novo construction. 

For example, the multiple copy simultaneous search method (MCSS) described by Miranker et al. 

30 (1991) Proteins 11: 29-34 may be employed. To determine and characterize a local minima of a functional 
group in the forcefield of the protein, multiple copies of selected functional groups are first distributed in a 
binding pocket of interest on the RTK protein. Energy minimization of these copies by molecular 
mechanics or quenched dynamics yields the distinct local minima. The neighborhood of these minima can 
then be explored by a grid search or by constrained minimization. In one embodiment, the MCSS method 

35 uses the classical time dependent Hartee (TDH) approximation to simultaneously niininiize or quench 
many identical groups in the forcefield of the protein. 

Implementation of the MCSS algorithm requires a choice of functional groups and a molecular 
mechanics model for each of them. Groups must be simple enough to be easily characterized and 



WO 02/061055 



-37- 



PCT/CA02/00114 



manipulated (3-6 atoms, few or no dihedral degrees of freedom), yet complex enough to approximate the 
steric and electrostatic interactions that the functional group would have in binding to the pocket or site of 
interest in the RTK protein. A preferred set is, for example, one in which most organic molecules can be 
described as a collection of such groups (PataVs Guide to the Chemistry of Functional Groups, ed. S. Patai 
5 (New York: John Wiley, and Sons, (1989)). This includes fragments such as acetonitrile, methanol, 
acetate, methyl ammonium, dimethyl ether, methane, and acetaldehyde. 

Determination of the local energy minima in the binding pocket or site requires that many starting 
positions be sampled. This can be achieved by distributing, for example, 1,000-5,000 groups at random 
inside a sphere centered on the binding site; only the space not occupied by the protein needs to be 

10 considered. If the interaction energy of a particular group at a certain location with the protein is more 
positive than a given cut-off (e.g. 5.0 kcal/mole) the group is discarded from that site. Given the set of 
starting positions, all the fragments are minimized simultaneously by use of the TDH approximation (Elber 
et al. (1990) J Am Chem Soc 112: 9161-9175). In this method, the forces on each fragment consist of its 
internal forces and those due to the protein. The essential element of this method is that the interactions 

15 between the fragments are omitted and the forces on the protein are normalized to those due to a single 
fragment. In this way simultaneous minimization or dynamics of any number of functional groups in the 
field of a single protein can be performed 

Minimization is performed successively on subsets of, for example 100, of the randomly placed 
groups. After a certain number of step intervals, such as 1,000 intervals, the results can be examined to 

20 eliminate groups converging to the same minimum. This process is repeated until minimization is 
complete (e.g. RMS gradient of 0.01 kcal/mole/C). Thus the resulting energy minimized set of molecules 
comprises what amounts to a set of disconnected fragments in three dimensions representing potential 
pharmacophores. 

Hie next step then is to connect the pharmacophoric pieces with spacers assembled from small 
25 chemical entities (atoms, chains, or ring moieties). In a preferred embodiment, each of the disconnected 
can be linked in space to generate a single molecule using such computer programs as, for example, 
NEWLEAD (Tschinke et al. (1993) J Med Chem 36: 3863,3870). The procedure adopted by NEWLEAD 
executes the following sequence of commands (1) connect two isolated moieties, (2) retain the 
intermediate solutions for further processing, (3) repeat the above steps for each of the intermediate 
30 solutions until no disconnected units are found, and (4) output the final solutions, each of which is a single 
molecule. Such a program can use for example, three types of spacers: library spacers, single-atom 
spacers, and fuse-ring spacers. The library spacers are optimized structures of small molecules such as 
ethylene, benzene and methylamide. The output produced by programs such as NEWLEAD consist of a 
set of molecules containing the original fragments now connected by spacers. The atoms belonging to the 
35 input fragments maintain their original orientations in space. The molecules are chemically plausible 
because of the simple makeup of the spacers and functional groups, and energetically acceptable because 
of the rejection of solutions with van-der Waals radii violations. 

A screening method of the present invention may comprise the following steps: 
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(i) generating a computer model of a binding pocket using a crystal according to the 
invention; 

(ii) docking a computer representation of a test compound with the computer model; 

(iii) analysing the fit of the compound in the binding pocket. 

5 In an aspect of the invention, a method is provided comprising the following steps: 

(a) docking a computer representation of a structure of a test compound into a computer 
representation of a binding pocket of an RTK defined in accordance with the invention using 
a computer program, or by interactively moving the representation of the test compound into 
the representation of the binding pocket; 
10 (b) characterizing the geometry and the complementary interactions formed between the atoms 

of the binding pocket and the compound; optionally 

(c) searching libraries for molecular fragments which can fit into the empty space between the 
compound and the binding pocket and can be linked to the compound; and 

(d) linking the fragments found in (c) to the compound and evaluating the new modified 
15 compound. 

In an embodiment of the invention, a method is provided which comprises the following steps: 
(a) docking a computer representation of a test compound from a computer data base with a 

computer representation of a selected binding pocket on an RTK defined in accordance with 

the invention to define a complex; 
20 (b) determining a conformation of the complex with a favorable fit and favourable 

complementary interactions; and 
(c) identifying test compounds that best fit the selected binding pocket as potential modulators of 

the RTK; 

In another embodiment of the invention, a method is provided which comprises docking a computer 
25 representation of a selected binding pocket of an RTK defined by the atomic interactions, atomic contacts, 
or structural coordinates in accordance with the invention to define a complex. In particular a method is 
provided comprising: 

(a) docking a computer representation of a test compound from a computer database with a computer 
representation of a selected binding pocket of an RTK defined by the atomic interactions, atomic 

30 contacts, or structural coordinates described herein; 

(b) determining a conformation of the complex with a favorable fit and favourable complementary 
interactions; and 

(c) identifying test compounds that best fit the selected binding pocket as potential modulators of the 
RTK. 

35 A model used in a screening method may comprise a binding pocket either alone or in association 

with one or more ligands and/or cofactors. For example, the model may comprise the binding pocket in 
association with a nucleotide (or analogue thereof), a substrate (or analogue thereof), and/or modulator. 
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If the model comprises an unassociated binding pocket, then the selected site under investigation 
may be the binding pocket itself. The test compound may, for example, mimic a known ligand (e.g. 
nucleotide or substrate) for an RTK in order to interact with the binding pocket. The selected site may 
alternatively be another site on the RTK. 
5 If the model comprises an associated binding pocket, for example a binding pocket in association 

with a ligand, the selected site may be the binding pocket or a site made up of the binding pocket and the 
compiexed ligand, or a site on the ligand itself. The test compound may be investigated for its capacity to 
modulate the interaction with the associated molecule. 

The screening methods described herein may be applied to a plurality of test compounds, to 
10 identify those that best fit the selected site. The screening methods may be used to identify a modulator 
that changes an autoinhibited state of an RTK to an active state, or an active state to an autoinhibited state. 

A test compound (or plurality of test compounds) may be selected on the basis of their similarity 
to a known ligand for an RTK, in particular an Eph receptor. For example, the screening method may 
comprise the following steps: 
1 5 (i) generating a computer model of a binding pocket in complex with a ligand; 

(ii) searching for a test compound with a similar three dimensional structure and/or similar 
chemical groups as the ligand; and 

(iii) evaluating the fit of the test compound in the binding pocket 

Searching may be carried out using a database of computer representations of potential 

20 compounds, using methods known in the art. 

The present invention also provides a method for designing ligands for RTKs. It is well known in 
the art to use a screening method as described above to identify a test compound with promising fit, but 
then to use this test compound as a starting point to design a ligand with improved fit to the model. Such 
techniques are known as "structure-based ligand design" (See Kuntz et al., 1994, Acc. Chem. Res. 27:117; 

25 Guida, 1994, Current Opinion in Struc. Biol. 4: 777; and Colman, 1994, Current Opinion in Struc. Biol. 4: 
868, for reviews of structure-based drug design and identification;and Kuntz et al 1982, J. Mol. Biol. 
162:269; Kuntz et al., 1994, Acc. Chem. Res. 27: 1 17; Meng et al., 1992, J. Compt. Chem. 13: 505; Bohm, 
1994, J. Comp. Aided Molec. Design 8: 623 for methods of structure-based modulator design). 

Examples of computer programs that may be used for structure-based ligand design are CAVEAT 

30 (Bartlett et al., 1989, in "Chemical and Biological Problems in Molecular Recognition", Roberts, S.M. 
Ley, S.V.; Campbell, N.M. eds; Royal Society of Chemistry: Cambridge, pp 182-196); FLOG (Miller et 
al., 1994, J. Comp. Aided Molec. Design 8:153); PRO Modulator (Clark et aL, 1995 J. Comp. Aided 
Molec. Design 9:13); MCSS (Miranker and Karplus, 1991, Proteins: Structure, Fuction, and Genetics 
8:195); and, GRID (Goodford, 1985, J. Med. Chem. 28:849). 

35 The method may comprise the following steps: 

(i) docking a model of a test compound with a model of a binding pocket; 

(ii) identifying one or more groups on the test compound which may be modified to improve 
their fit in the binding pocket; 
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(iii) replacing one or more identified groups to produce a modified test compound model; and 

(iv) docking the modified test compound model with the model of the binding pocket 
Evaluation of fit may comprise the following steps: 

(a) mapping chemical features of a test compound such as by hydrogen bond donors or 
5 acceptors, hydrophobic/lipophilic sites, positively ionizable sites, or negatively ionizable 

sites; and 

(b) adding geometric constraints to selected mapped features. 

The fit of the modified test compound may then be evaluated using the same criteria. 
The chemical modification of a group may either enhance or reduce hydrogen bonding 

10 interaction, charge interaction, hydrophobic interaction, Van Der Waals interaction or dipole interaction 
between the test compound and the key amino acid residue(s) of the binding pocket. Preferably the group 
modifications involve the addition removal or replacement of substituents onto the test compound such 
that the substituents are positioned to collide or to bind preferentially with one or more amino acid residues 
that correspond to the key amino acid residues of the binding pocket. 

15 If a modified test compound model has an improved fit, then it may bind to a binding pocket and 

be considered to be a "ligand". Rational modification of groups may be made with the aid of libraries of 
molecular fragments which may be screened for their capacity to fit into the available space and to interact 
with the appropriate atoms. Databases of computer representations of libraries of chemical groups are 
available commercially, for this purpose. 

20 The test compound may also be modified "in sitt? (i.e. once docked into the potential binding 

pocket), enabling immediate evaluation of the effect of replacing selected groups. The computer 
representation of the test compound may be modified by deleting a chemical group or groups, or by adding 
a chemical group or groups. After each modification to a compound, the atoms of the modified compound 
and potential binding pocket can be shifted in conformation and the distance between the modulator and 

25 the binding pocket atoms may be scored on the basis of geometric fit and favourable complementary 
interactions between the molecules. This technique is described in detail in Molecular Simulations User 
Manual, 1995inLUDI. 

Examples of ligand building and/or searching computer programs include programs in the 
Molecular Simulations Package (Catalyst), ISIS/HOST, ISIS/BASE, and ISIS/DRAW (Molecular Designs 
30 Limited), and UNITY (Tripos Associates). 

The "starting point" for rational ligand design may be a known ligand for the enzyme. For 
example, in order to identify potential modulators of an RTK, in particular an Eph receptor, a logical 
approach would be to start with a known ligand (for example a nucleotide or known kinase inhibitors) to 
produce a molecule which mimics the binding of the ligand. Such a molecule may, for example, act as a 
35 competitive inhibitor for the true ligand, or may bind so strongly that the interaction (and inhibition) is 
effectively irreversible. 

Such a method may comprise the following steps: 

(i) generating a computer model of a binding pocket in complex with a ligand; 
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(ii) replacing one or more groups on the ligand model to produce a modified ligand; and 

(iii) evaluating the fit of the modified ligand in the binding pocket. 

The replacement groups could be selected and replaced using a compound construction program 
which replaces computer representations of chemical groups with groups from a computer database, where 
5 the representations of the compounds are defined by structural coordinates. 

. In an embodiment, a screening method is provided for identifying a ligand of an RTK, in 
particular an Eph receptor, comprising the step of using the structural coordinates of a nucleotide or 
component thereof, defined in relation to its spatial association with a binding pocket of the invention, to 
generate a compound that is capable of associating with the binding pocket 

10 In an embodiment of the invention, a screening method is provided for identifying a ligand of an 

RTK, in particular an Eph receptor, comprising the step of using the structural coordinates of adenosine 
adenine, or ATP listed in Table 3 to generate a compound for associating with a binding pocket of RTK, in 
particular an Eph receptor as described herein. The following steps are employed in a particular method of 
the invention: (a) generating a computer representation of adenosine adenine, or ATP, defined by its 

15 structural coordinates listed in Table 3; (b) searching for molecules in a data base that are structurally or 
chemically similar to the defined adenosine adenine, or ATP, using a searching computer program, or 
replacing portions of the adenosine adenine, or ATP with similar chemical structures from a database using 
a compound building computer program. 

A screening method is provided for identifying a ligand of an RTK, in particular an Eph receptor, 

20 comprising the step of using the structural coordinates of a binding pocket comprising a juxtamembrane 
region or part thereof listed in Table 3 to generate a compound for associating with a kinase domain of an 
RTK, in particular an Eph receptor. The following steps are employed in a particular method of the 
invention: (a) generating a computer representation of a binding pocket comprising a juxtamembrane 
region or part thereof defined by its structural coordinates listed in Table 3; and (b) searching for 

25 molecules in a data base that are structurally or chemically similar to the defined binding pocket using a 
searching computer program, or replacing portions of the binding pocket with structures from a database 
using a compound building computer program. 

The screening methods of the present invention may be used to identify compounds or entities 
that associate with a molecule that associates with an RTK, in particular an Eph receptor (for example, a 

30 nucleotide). 

Compounds and entities (e.g. ligands) of RTKs, in particular Eph receptors, identified using the 
above-described methods may be prepared using methods described in standard reference sources utilized 
by those skilled in the art For example, organic compounds may be prepared by organic synthetic methods 
described in references such as March, 1994, Advanced Organic Chemistry: Reactions, Mechanisms, and 
35 Structure, New York, McGraw Hill. 

Test compounds and ligands which are identified using a crystal or model of the present invention 
can be screened in assays such as those well known in the art. Screening may be for example in vitro, in 
cell culture, and/or in vivo. Biological screening assays preferably centre on activity-based response 
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models, binding assays (which measure how well a compound binds to a binding pocket of a receptor), and 
bacterial, yeast, and animal cell lines (which measure the biological effect of a compound in a cell). The 
assays may be automated for high throughput screening in which large numbers of compounds can be 
tested to identify compounds with the desired activity. The biological assay may also be an assay for the 
5 binding activity of a compound that selectively binds to the binding pocket compared to other receptors. 

LIGANDS/COMPOUNDS IDENTIFIED BY SCREENING METHODS 

The present invention provides a ligand or compound identified by a screening method of the 
present invention. A ligand or compound may have been designed rationally by using a model according 

10 to the present invention. A ligand or compound identified using the screening methods of the invention 
may specifically associate with a target compound, or part thereof (e.g. a binding pocket). In the present 
invention the target compound may be the RTK (e.g. Eph receptor) or part thereof, or a molecule that is 
capable of associating with the RTK or part thereof (for example a nucleotide). In an embodiment, the 
ligand is capable of binding to phosphoregulatory sites of a binding pocket, in particular 

1 5 phosphoregulatory sites of a juxtamembrane region or kinase domain. In another embodiment, the ligand is 
capable of binding to the activation segment of a kinase domain of an Eph receptor. 

A ligand or compound identified using a screening method of the invention may act as a 
"modulator", i.e. a compound which affects the activity of an RTK in particular an Eph receptor. A 
modulator may reduce, enhance or alter the biological function of an R1K, in particular an Eph receptor. 

20 For example a modulator may modulate the capacity of the RTK to autophosphorylate. An alteration in 
biological function may be characterised by a change in specificity. For example, a modulator may cause 
the RTK to accept a different nucleotide, to phosphorylate a different amino acid residue, or to work with a 
different metal cofactor. A modulator may dispose an RTK to favor the autoinhibited state or active state. 
In order to exert its function, the modulator commonly binds to a binding pocket 

25 A "modulator" which is capable of reducing the biological function of the enzyme may also be 

known as an inhibitor. Preferably an inhibitor reduces or blocks the capacity of the enzyme to 
autophosphorylate. An inhibitor may promote the autoinhibition state of an RTK. The inhibitor may 
mimic the binding of a nucleotide or substrate, for example, it may be a nucleotide or substrate analogue. 
A nucleotide analogue may be designed by considering the interactions between the nucleotide and the 

30 RTK (for example, by using information derivable from the crystal of the invention) and specifically 
altering one or more groups (as described above). 

The present invention also provides a method for modulating the activity of an RTK, in particular 
an Eph receptor, using a modulator according to die present invention. The invention also provides a 
method for inhibiting autophosphorylation of an RTK, preferably an Eph receptor, by potentiating the 

35 autoinhibition state of an RTK, or inhiting the active state of the RTK. Inhibition of phosphorylation of an 
RTK may decrease signaling by the RTK and inhibit cellular processes that may be involved in disease. It 
would be possible to monitor receptor activity following such treatments by a number of methods known 
in the art. 
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A modulator may be an agonist, partial agonist, partial inverse agonist or antagonist of an RTK. 

As used herein, the term "agonist" means any ligand, which is capable of binding to a binding 
pocket and which is capable of increasing a proportion of the receptor that is in an active form, resulting in 
an increased biological response. The term includes partial agonists and inverse agonists. 
5 As used herein, the term "partial agonist" means an agonist that is unable to evoke the maximal 

response of a biological system, even at a concentration sufficient to saturate the specific receptors. 

As used herein, the term "partial inverse agonist" is an inverse agonist that evokes a submaximal 
response to a biological system, even at a concentration sufficient to saturate the specific receptors. At 
high concentrations, it will diminish the actions of a full inverse agonist 
10 As used herein, the term "antagonist" means any agent that reduces the action of another agent, 

such as an agonist. The antagonist may act at the same site as the agonist (competitive antagonism). The 
antagonistic action may result from a combination of the substance being antagonised (chemical 
antagonism) or the production of an opposite effect through a different receptor (functional antagonism or 
physiological antagonism) or as a consequence of competition for the binding site of an intermediate that 
1 5 links receptor activation to the effect observed (indirect antagonism). 

As used herein, the term "competitive antagonism" refers to the competition between an agonist 
and an antagonist for a binding pocket of a receptor that occurs when the binding of agonist and antagonist 
becomes mutually exclusive. This may be because the agonist and antagonist compete for the same 
binding sites or pockets, or combine with adjacent but overlapping sites. A third possibility is that 
20 different sites are involved but that they influence the receptor macromolecules in such a way that agonist 
and antagonist molecules cannot be bound at the same time. If the agonist and antagonist form only short 
lived combinations with a binding pocket of a receptor so that equilibrium between agonist, antagonist and 
receptor is reached during the presence of the agonist, the antagonism will be surmountable over a wide 
range of concentrations. In contrast, some antagonists, when in close enough proximity to their binding 
25 site, may form a stable covalent bond with it and the antagonism becomes insurmountable when no spare 
receptors remain. 

As mentioned above, an identified ligand or compound may act as a ligand model (for example, a 
template) for the development of other compounds. A modulator may be a mimetic of a ligand. 

Like the test compound (see above) a modulator may be one or a variety of different sorts of 
30 molecule.(See examples herein.) A modulator may be an endogenous physiological compound, or it may 
be a natural or synthetic compound. The modulators of the present invention may be natural or synthetic. 
The term "modulator" also refers to a chemically modified ligand or compound. 

The technique suitable for preparing a modulator will depend on its chemical nature. For example, 
peptides can be synthesized by solid phase techniques (Roberge JY et al (1995) Science 269: 202-204) and 
35 automated synthesis may be achieved, for example, using the ABI 43 1 A Peptide Synthesizer (Perkin 
Elmer) in accordance with the instructions provided by the manufacturer. Once cleaved from the resin, the 
peptide may be purified by preparative high performance liquid chromatography (e.g., Creighton (1983) 
Proteins Structures and Molecular Principles, WH Freeman and Co, New York NY). The composition of 
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the synthetic peptides may be confirmed by amino acid analysis or sequencing (e.g., the Edman 
degradation procedure; Creighton, supra). 

If a modulator is a nucleotide, or a polypeptide expressable therefrom, it may be synthesized, in 
whole or in part, using chemical methods well known in the art (see Caruthers MH et al (1980) Nuc Acids 
5 Res Symp Ser 215-23, Horn T et al (1980) Nuc Acids Res Symp Ser 225-232), or it may be prepared using 
recombinant techniques well known in the art 

Organic compounds may be prepared by organic synthetic methods described in references such 
as March, 1994, Advanced Organic Chemistry: Reactions, Mechanisms, and Structure, New York, 
McGraw Hill. 

10 The invention also relates to classes of modulators of RTKs based on the structure and shape of a 

nucleotide, or component thereof, or a substrate or component thereof, defined in relation to the 
nucleotide's or substrate's spatial association with a crystal structure of the invention or part thereof; 

A class of modulators may comprise a compound containing a structure of adenine, adenosine, 
ribose, pyrophosphate, or ATP, and having one or more, preferably all, of the structural coordinates of 
15 adenine, adenosine, ribose, pyrophosphate, or ATP of Table 4. Functional groups in the adenine, 
adenosine, ribose, pyrophosphate, or ATP modulators may be substituted with, for example, alkyl, alkoxy, 
hydroxyl, aryl, cycloalkyl, alkenyl, alkynyl, thiol, thioalkyl, thioaryl, amino, or halo, or they may be 
modified using techniques known in the art. 

Another class of modulators defined by the invention are compounds comprising an adenine 
20 triphosphate group having the structural coordinates of adenine triphosphate in the active site binding 
pocket of an Eph receptor. 

The invention contemplates all optical isomers andracemic forms of the modulators of the 
invention. 

PHARMACEUTICAL COMPOSITION 

25 The present invention also provides for the use of a modulator according to the invention, in the 

manufacture of a medicament to treat and/or prevent a disease in a rnarnmalian patient There is also 
provided a pharmaceutical composition comprising such a modulator and a method of treating and/or 
preventing a disease comprising the step of adniinistering such a modulator or pharmaceutical composition 
to a subject, preferably a mammalian patient 

30 The pharmaceutical compositions may be for human or animal usage in human and veterinary 

medicine and will typically comprise a pharmaceutically acceptable carrier, diluent, excipient, adjuvant or 
combination thereof. 

Acceptable carriers or diluents for therapeutic use are well known in the pharmaceutical art, and 
are described, for example, in Remington's Pharmaceutical Sciences, Mack Publishing Co. (A. R. 
35 Gennaro edit. 1985). The choice of pharmaceutical carrier, excipient or diluent can be selected with regard 
to the intended route of administration and standard pharmaceutical practice. The pharmaceutical 
compositions may comprise as - or in addition to - the carrier, excipient or diluent any suitable binder(s), 
lubricant(s), suspending agent(s), coating agent(s), solubilising agent(s). 
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Preservatives, stabilizers, dyes and even flavouring agents may be provided in the pharmaceutical 
composition. Examples of preservatives include sodium benzoate, sorbic acid and esters of p- 
hydroxybenzoic acid. Antioxidants and suspending agents may also be used. 

The routes for adrriinistration (delivery) include, but are not limited to, one or more of: oral (e.g. as 
5 a tablet, capsule, or as an ingestable solution), topical, mucosal (e.g. as a nasal spray or aerosol for 
inhalation), nasal, parenteral (e.g. by an injectable form), gastrointestinal, intraspinal, intraperitoneal, 
intramuscular, intravenous, intrauterine, intraocular, intradermal, intracranial, intratracheal, intravaginal, 
intracerebroventricular, intracerebral, subcutaneous, ophthalmic (including intravitreal or intracameral), 
transdermal, rectal, buccal, vaginal, epidural, sublingual. 

10 Where the pharmaceutical composition is to be delivered mucosally through the gastrointestinal 

mucosa, it should be able to remain stable during transit though the gastrointestinal tract; for example, it 
should be resistant to proteolytic degradation, stable at acid pH and resistant to the detergent effects of bile. 

Where appropriate, the pharmaceutical compositions can be administered by inhalation, in the 
form of a suppository or pessary, topically in the form of a lotion, gel, hydrogel, solution, cream, ointment 

15 or dusting powder, by use of a skin patch, orally in the form of tablets containing excipients such as starch 
or lactose or chalk, or in capsules or ovules either alone or in admixture with excipients, or in the form of 
elixirs, solutions or suspensions containing flavouring or colouring agents, or they can be injected 
parenterally, for example intravenously, intramuscularly or subcutaneously. For parenteral administration, 
the compositions may be best used in the form of a sterile aqueous solution which may contain other 

20 substances, for example enough salts or monosaccharides to make the solution isotonic with blood. The 
aqueous solutions should be suitably buffered (preferably to a pH of from 3 to 9), if necessary. The 
preparation of suitable parenteral formulations under sterile conditions is readily accomplished by standard 
pharmaceutical techniques well-known to those skilled in the art 

If the agent of the present invention is adniinistered parenterally, then examples of such 

25 administration include one or more of: intravenously, intra-arterially, intraperitoneally, intrathecally, 
intraventricularly, mtraurethrally, intrastemally, intracranially, intramuscularly or subcutaneously 
administering the agent; and/or by using infusion techniques. 

For buccal or sublingual adrninistration the compositions may be administered in the form of 
tablets or lozenges which can be formulated in a conventional manner. 

30 The tablets may contain excipients such as microcrystalline cellulose, lactose, sodium citrate, 

calcium carbonate, dibasic calcium phosphate and glycine, disintegrants such as starch (preferably corn, 
potato or tapioca starch), sodium starch glycollate, croscarmellose sodium and certain complex silicates, 
and granulation binders such as polyvinylpyrrolidone, hydroxypropylmethylcellulose (HPMC), 
hydroxypropylcellulose (HPC), sucrose, gelatin and acacia. Additionally, lubricating agents such as 

35 magnesium stearate, stearic acid, glyceryl behenate and talc may be included. 

Solid compositions of a similar type may also be employed as fillers in gelatin capsules. 
Preferred excipients in this regard include lactose, starch, cellulose, milk sugar or high molecular weight 
polyethylene glycols. For aqueous suspensions and/or elixirs, the agent may be combined with various 
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sweetening or flavouring agents, colouring matter or dyes, with emulsifying and/or suspending agents and 
with diluents such as water, ethanol, propylene glycol and glycerin, and combinations thereof. 

As indicated, a therapeutic agent (e.g. modulator) of the present invention can be administered 
intranasally or by inhalation and is conveniently delivered in the form of a dry powder inhaler or an 
5 aerosol spray presentation from a pressurised container, pump, spray or nebuliser with the use of a suitable 
propellant, e.g. dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, a 
hydrofluoroalkane such as 1,1,1,2-tefrafluoroethane (HFA 134A™) or 1,1, 1,2,3,3,3 -heptafluoropropane 
(HFA 227EA™), carbon dioxide or other suitable gas. In the case of a pressurised aerosol, the dosage unit 
may be determined by providing a valve to deliver a metered amount. The pressurised container, pump, 

10 spray or nebuliser may contain a solution or suspension of the active compound, e.g. using a mixture of 
ethanol and the propellant as the solvent, which may additionally contain a lubricant, e.g. sorbitan trioleate. 
Capsules and cartridges (made, for example, from gelatin) for use in an inhaler or insufflator may be 
formulated to contain a powder mix of the agent and a suitable powder base such as lactose or starch. 

Therapeutic administration of polypeptide modulators may also be accomplished using gene 

15 therapy. A nucleic acid including a promoter operatively linked to a heterologous polypeptide may be used 
to produce high-level expression of the polypeptide in cells transfected with the nucleic acid. DNA or 
isolated nucleic acids may be introduced into cells of a subject by conventional nucleic acid delivery 
systems. Suitable delivery systems include liposomes, naked DNA, and receptor-mediated delivery 
systems, and viral vectors such as retroviruses, herpes viruses, and adenoviruses. 

20 APPUCATIONS 

The invention provides a method for inhibiting kinase activity of an RTK comprising maintaining 

the RTK or a binding pocket thereof involved in regulating the kinase domain in an autoinhibited state, or 
potentiating an autoinhibited state for the RTK or binding pocket thereof involved in regulating the kinase 
domain. An autoinhibited state may be maintained or potentiated by inhibiting phosphorylation of 

25 phosphoregulatory sites of the juxtamembrane segment and/or kinase domain (e.g. activation segment). 
Inhibition may be accomplished using modulators, or altering the structure of a binding pocket of the RTK 
comprising the phosphoregulatory sites, to prevent phosphorylation of the sites. 

The invention contemplates a method for altering the stability of an autoinhibited state of an RTK 
comprising phosphorylation phosphoregulatory sites of a juxtamembrane region of the RTK. 

30 In an aspect the invention relates to a method for changing an RTK from an autoinhibited state to 

an active state comprising phosphorylating phosphoregulatory sites of a juxtamembrane region of the 
RTK. 

In another aspect the invention provides a method for activating kinase activity of an RTK 
comprising phosphorylating phosphoregulatory sites of a juxtamembrane region and kinase domain (e.g. 
35 activation segment) of the RTK. 

The invention further provides a method of treating a mammal, the method comprising 
administering to a mammal a modulator or pharmaceutical composition of the present invention. 

In particular, the invention contemplates a method of treating or preventing a condition or disease 
associated with an RTK in a cellular organism, comprising: 
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(c) administering a modulator of the invention in an acceptable pharmaceutical preparation; and 

(d) activating or inhibiting the RTK to treat or prevent the disease. 

In an aspect the invention provides a method for treating or preventing a condition or disease 
involving increased RTK activity comprising maintaining the RTK or a binding pocket thereof involved in 
5 regulating the kinase domain of the RTK in an autoinhibited state. An autoinhibited state may be 
maintained as described herein. In an embodiment the condition or disease is cancer. 

The invention provides for the use of a modulator identified by the methods of the invention in 
the preparation of a medicament to treat or prevent a disease in a cellular organism. Use of modulators of 
the invention to manufacture a medicament is also provided. 
10 Typically, a physician will determine the actual dosage of a modulator or pharmaceutical 

composition of the invention that will be most suitable for an individual subject and it will vary with the 
age, weight and response of the particular patient and severity of the condition. There can, of course, be 
individual instances where higher or lower dosage ranges are merited. 

The specific dose level and frequency of dosage for any particular patient may be varied and will 
15 depend upon a variety of factors including the activity of the specific compound employed, the metabolic 
stability and length of action of that compound, the age, body weight, general health, sex, diet, mode and 
time of administration, rate of excretion, drug combination, the severity of the particular condition, and the 
individual undergoing therapy. By way of example, the pharmaceutical composition of the present 
invention may be administered in accordance with a regimen of 1 to 10 times per day, such as once or 
20 twice per day. 

For oral and parenteral administration to human patients, the daily dosage level of the agent may 
be in single or divided doses. 

The modulators and compositions of the invention may be useful in the prevention and treatment 
of conditions involving aberrant RTKs. 

25 Conditions which may be prevented or treated in accordance with the invention include but are 

not limited to lymphoproliferative conditions, malignant and pre-malignant conditions, arthritis, 
inflammation, and autoimmune disorders. Malignant and pre-malignant conditions may include solid 
tumors, B cell lymphomas, chronic lymphocytic leukemia, chronic myelogenous leukemia, prostate 
hypertrophy, Hirschsprung disease, glioblastoma, breast and ovarian cancer, adenocarcinoma of the 

30 salivary gland, premyelocytic leukemia, prostate cancer, multiple endocrine neoplasia type HA and EDB, 
medullary thyroid carcinoma, papillary carcinoma, papillary renal carcinoma, hepatocellular carcinoma, 
gastrointestinal stromal tumors, sporadic mastocytosis, acute myeloid leukemia, large cell lymphoma or 
Alk lymphoma, chronic myeloid leukemia, hematological /solid tumors, papillary thyroid carcinoma, stem 
cell leukemia/lymphoma syndrome, acute myelogenous leukemia, osteosarcoma, multiple myeloma, 

35 preneoplastic liver foci, and resistance to chemotherapy. Diseases associated with increased cell survival, 
or the inhibition of apoptosis, include cancers (e.g. follicular lymphomas, carcinomas with p53 mutations, 
hormone-dependent tumors such as breast cancer, prostate cancer, Kaposi's sarcoma and ovarian cancer); 
autoimmune disorders (such as lupus erythematosus and immune-related glomerulonephritis rheumatoid 
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arthritis) and viral infections (such as herpes viruses, pox viruses, and adenoviruses); inflammation, graft 
vs. host disease, acute graft rejection and chronic graft rejection. 

Eph receptors and ephrins mediate contact-dependent repulsive guidance of migrating cells and 
axons in culture and in vivo. Many Eph family members are prominently expressed in the developing 
5 nervous system, and ephrin stimulation of growing primary axons in vitro results in axonal retraction or 
repulsion, characterized by a collapse of actin-rich growth cone structures at the leading edge of the cell. 
Mice bearing homozygous null mutations in EphA8 or in both EphB2 and EphB3 exhibit abnormal 
migration of axon tracts in the brain. Ephrin-induced retraction of exploratory actin filopodia has also been 
described in vivo in migrating Eph receptor-expressing neural crest cells. 

10 The Eph receptors and ephrins have also been implicated in cell sorting and boundary formation. 

Eph-receptor signaling is able to modulate both cell-cell and cell-substrate attachment Bidirectional Eph 
receptor-ephrin signaling is important for the formation of boundaries between rhombomeres of the hind 
brain. These cellular responses to Eph receptor stimulation indicate that they may regulate signaling events 
which control cytoskeletal architecture and cell adhesion functions. 

15 Therefore, modulators of Eph receptors may be used to modulate axonogenesis, nerve cell 

interactions and regeneration, to treat conditions such as neurodegenerative diseases and conditions 
involving trauma and injury to the nervous system, for example Alzheimer's disease, Parkinson's disease, 
Huntington's disease, demylinating diseases, such as multiple sclerosis, amyotrophic lateral sclerosis, 
bacterial and viral infections of the nervous system, deficiency diseases, such as Wernicke's disease and 

20 nutritional polyneuropathy, progressive supranuclear palsy, Shy Drager's syndrome, multistem 
degeneration and olivo ponto cerebellar atrophy, peripheral nerve damage, trauma and ischemia resulting 
from stroke. 

Therapeutic efficacy and toxicity of compositions and modulators of the invention may be 
determined by standard pharmaceutical procedures in cell cultures or with experimental animals, such as 
25 by calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD50 (the dose 
lethal to 50% of the population) statistics. The therapeutic index is the dose ratio of therapeutic to toxic 
effects and it can be expressed as the ED 5 o/LD 50 ratio. Pharmaceutical compositions that exhibit large 
therapeutic indices are preferred. 

The invention will now be illustrated by the following non-limiting example: 
30 EXAMPLE: 

The following methods were used in the investigation described in the example: 

Methods 

Protein Expression and Purification 

Mutagenesis of the juxtamembrane tyrosines (Y604/610F) of murine EphB2 was performed using 
35 a PCR-based approach. The amplified cDNA sequence, corresponding to the receptor's juxtamembrane 
region and kinase domain (residues 595-906), was cloned into pGEX-4T-l (Pharmacia). The glutathione- 
S transferase (GS1>EphB2 construct was transformed into Escherichia coli B834 cells and the cells 
grown in rninimal media supplemented with selenomethionine, with overnight induction at 15°C, and 0.15 
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mM IPTG (isopropyl-P-D-thiogalactopyranoside, BioShop). Cells were lysed by homogenization and 
sonication in 25 mM HEPES (pH 7.5), 50 mM NaCl, 20% glycerol, 2 mM DTT, 2mM phenyl-methyl 
sulphonyl fluoride. Purification of the selenomethionyl derivative of EphB2 was performed as previously 
described (Binns et al., 2000), with the exception that the buffer used for gel filtration (buffer C) was 10 
5 mM HEPES (pH 7.5), 50 mM NaCl, 1 mM DTT. 

Crystallization, Data Collection, and Structure Determination 

Hanging drops containing 1 ul of 12.5 mg/ml protein in buffer C were mixed with equal volumes 
of reservoir buffer containing 0.1 M HEPES (pH 7.0), 02 M magnesium chloride, 10% (w/v) PEG 4000, 
10% (v/v) isopropanol, and 15% (v/v) ethylene glycol Rod-like crystals were obtained overnight at 28°C 

10 after streak seeding with smaller crystals obtained initially. The crystals belong to primitive space group 
P2,, (a - 47.86 A, b = 98.09 A, c = 68.18 A, a = y = 90°, p = 104.97°), with two molecules of EphB2 in the 
asymmetric unit. Crystals were flash frozen by immersion in liquid nitrogen. A MAD experiment was 
performed on a frozen crystal at APS beamline BM 14-D (XI - 0.9790 A, %2 = 0.9788 A, A3 = 0.9770 A) 
using a Quantum 4 ADSC CCD detector. Data processing and reduction was carried out with the HKL 

15 program suite (Otwinowski and Minor, 1997). The programs SHARP (La Fortelle and Bricogne, 1997) 
and SnB (Miller et aL, 1994) were used in combination to locate and subsequently refine positions for 22 
of the possible 30 Se sites. Following density modification with Solomon (Abrahams and Leslie, 1996), a 
partial model was generated using O (Jones.et al, 1991) and refined using CNS (Brunger et aL, 1998) (R- 
factors > 40%). Consequently, crystals of EphB2 in complex with 2 uM AMP-PNP were grown as 

20 described above (space group - PI, a = 47.05 A, b = 57.62 A, c = 67.74 A, a = 112.95°, p » 103.17°, y = 
91.58°), with two molecules per asymmetric unit Diffraction data was collected to 1.9 A at APS beamline 
BM 14-C (X = 1.00 A) using a Quantum 4 ADSC CCD detector and processed with the HKL program 
suite. Molecular replacement solutions were determined with AMoRe (Navaza, 1994; CCP4, 1994), using 
one monomer of the P2i-derived model as a search molecule. The two AMoRe solutions, which 

25 correspond to the two EphB2 molecules in the asymmetric unit, refined readily in CNS. With minimal 
modification to the starting model, the model has been refined to a working R value of 24.1% and a free R 
value of 27.7%. As defined in PROCHECK (Laskowski et al., 1993), 90.8 % of protein residues are in the 
most favored regions of the Ramachandran plot, with none in the disallowed regions. Pertinent statistics 
for data collection and refinement are shown in Table 1. 

30 Mutagenesis 

The cDNA sequence of the juxtamembrane region and kinase domain of murine EphA4 (amino 
acids 591-896), corresponding to residues 599-906 of murine EphB2, was cloned into pGEX-4T-2 
(Pharmacia). The murine EphB2 numbering scheme was employed, and the corresponding EphA4 residue 
numbers are listed in parentheses. Using a PCR-based approach, Tyr 604 (Tyr596) and Tyr 610 (Tyr602) 
35 were mutated to phenylalanine. The following site-directed mutants were then generated using this doubly 
mutated construct: (1) ADt^; deletion of 599-621 (591-613), (2) AJX1; deletion of 599-606 (591-598), (3) 
AJX1+2; deletion of 599-610 (591-602), (4) Pro607Gly (Pro599Gly), (5) Phe608Asp (Phe600Asp), (6) 
Phe620Asp (Phe612Asp), (7) Ser680Trp (Ser672Trp), (8) AJX1+2 plus Phe620Asp, and (9) 



WO 02/061055 



-50- 



PCT/CA02/00114 



Tyr604/610Glu (Tyr596/602Glu). The GST-EphA4 constructs were transformed into K coli BL21 codon 
plus cells and grown in LB supplemented with ampicillin, with overnight induction at 15°C, 0.15 mM 
IPTG. Purification was performed as described for EphB2. The mutations Tyr604Phe, Tyr610Phe, 
Pro607Gly, Phe620Asp, Ser680Trp s Gln684Trp, deletion of 599-606 (AJX1), deletion of 599-610 
5 (AJX1+2), and deletion of 600-621 (AJXlau) in murine EphB2 were generated by site-directed 
mutagenesis using overlapping oligonucleotide primers containing the above indicated point mutations or 
deletions. All mutations were confirmed by DNA sequencing. 
Western Blotting 

GST-EphA4 proteins expressed in E. coli (BL21 codon plus), and EphB2 proteins transiently 

10 expressed in COS-1 ceils, were harvested as previously described (Binns et al, 2000; Holland et al, 1997). 
Proteins were resolved using 12% denaturing polyacrylamide gel electrophoresis (PAGE), transferred onto 
a poly vinylidene difluoride membrane (Millipore), blotted with anti-pTyr (Upstate Biotechnology), anti- 
GST (Santa Cruz Laboratories), or anti-EphB2 antibodies (Holland et al., 1997), and visualized using 
enhanced chemiluminescence (ECL Plus; Amersham). 

15 In vitro Kinase Reactions 

In vitro kinase reactions using GST fusion EphA4 proteins bound to glutathione sepharose or 
immunoprecipitated EphB2 proteins transiently expressed in COS-1 cells were performed with 5 u.g and 2 
ug of acid-denatured enolase, respectively, and 5 u£i of [y 32 P]ATP at room temperature as previously 
described (Binns et al., 2000). 

20 Spectrophotometric coupling assay 

Kinetic analysis of the bacterial expressed EphA4 proteins was performed using a coupled in vitro 
spectrophotometric kinase assay where production of ADP is coupled to the oxidation of NADH through 
pyruvate kinase and lactic dehydrogenase (Barker et al., 1995; Binns et al, 2000). The 100-ul reaction 
volume contained 1 U lactic dehydrogenase, 1 U pyruvate kinase, 1 mM phosphoenolpyruvate, 0.2 mM 

25 NADH, and 0.5 mM ATP (in 20 mM MgCl 2 , 0.1 mM DTT, 60 mM HEPES [pH 7.5], 20 ug/mL bovine 
serum albumin). Wild type and mutant EphA4 activity was measured by monitoring absorbance at 340 nm 
(Varian UV- Visible spectrophotometer) for 90 minutes at a fixed enzyme concentration (0.5 uM) and 1 
mM S-l synthetic peptide (GEEIYGEFD; amide at carboxy terminus) concentrations. For accuracy, 
protein concentrations were determined by UV spectrometry at 280 nm using molar extinction coefficients. 

30 (Andersson, 1998; Collaborative Computational Project, 1994). 
Results and Discussions 
Structure Determination 

Since the expression of active EphB2 polypeptides in E. coli is toxic, efforts were focused on the 
catalytically repressed Tyr 604/610 Phe double mutant. For the purposes of discussion, these sites are 

35 referred to as Tyr/Phe 604 and Tyr/Phe 610. A cytoplasmic fragment (residues 595 to 906) of the murine 
EphB2 RTK consisting of the latter half of the juxtamembrane region and the entire catalytic domain was 
expressed as a GST fusion in E. coli and purified to homogeneity (see Methods). The predicted boundaries 
of the juxtamembrane region are residues 573-620, while those of the kinase and SAM domains are 
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residues 621-892 and residues 919-994, respectively. Protein crystals of two different space groups were 
grown and the EphB2 structure was determined using a combination of seleno-methionine 
multiwavelength anomalous dispersion (SeMet MAD) and molecular replacement (MR) methods (see 
Methods). The EphB2 crystal structure reported here corresponds to the juxtamembrane-catalytic domain 
5 fragment in complex with AMP-PNP (P, y -imidoadenosine-5' -triphosphate). Overall, the EphB2 structure 
is well ordered except for the first seven and last six amino acid residues, kinase domain residues 651 to 
653 connecting p-strands 2 and 3 of the N-terminal catalytic lobe, and residues 774 to 796 corresponding 
to the kinase activation segment within the C-terminal lobe. Only the adenine ring of AMP-PNP is 
ordered in experimental and model based electron density maps, and hence the sugar and phosphate groups 

10 have not been modeled. Data collection and refinement statistics are listed in Table 1 and a representative 
alignment of the EphB2 receptor and other protein kinase family members is provided in Figure 1. 
Overall description of the autoinhibited structure 

The structure of the catalytic domain of EphB2 conforms to that generally observed for protein 
kinases, consisting of two lobes, a smaller N-terminal lobe and larger C-terminal lobe (Figure 2a,b). 

15 Protein kinases are capable of a range of conformations owing to an inherent inter-lobe flexibility that 
allows for both open and closed conformations. However, the catalytically competent conformation is 
generally a closed structure in which the two catalytic lobes clamp together to form an interfacial 
nucleotide binding site and catalytic cleft. Surprisingly, the autoinhibited EphB2 catalytic domain adopts 
a closed conformation that resembles an 'active' state. 

20 The N-terminal lobe of protein kinases consists minimally of a twisted 5-strand p-sheet (denoted 

PI to P5 as first described for the cAMP dependent protein kinase (cAPK) and a single helix otC (Knighton 
et al, 1991). The N-terminal lobe functions to assist in the binding and coordination of ATP for the 
productive transfer of the y-phosphate to a substrate oriented by the C-terminal lobe. In this regard, p- 
strands 1 and 2 and the glycine rich connecting segment (g-loop) form a flexible flap that interacts with the 

25 adenine base, ribose sugar and the non-hydrolyzable phosphate groups of ATP. Furthermore, an invariant 
salt bridge between a lysine side chain (sub-domain 2 in the protein kinase nomenclature of Hanks et al., 
1988) in p-strand 3 and a glutamic acid side chain (sub-domain 3) in helix aC coordinates the P-phosphate 
of ATP. In the EphB2 crystal structure, all N-terrninal lobe elements implicated in nucleotide binding are 
well ordered and adopt a prototypical protein kinase arrangement. However, distortions in helix ccC and the 

30 g-loop arising from interactions with the juxtamembrane segment are evident. 

The C-terminal lobe of protein kinases consists minimally of two P-strands (P7and P8) and a 
series of a-helices (aD to al). Strands P7and p8 locate to the cleft region between the N- and C-terminal 
lobes where they contribute side chains that participate in catalysis and the binding of magnesium for the 
coordination of ATP phosphate groups. In the EphB2 crystal structure, all lower lobe residues implicated 

35 in catalysis and ATP coordination appear optimally oriented (Figure 3c). The activation segment, which is 
also located in the large catalytic lobe, is disordered as in several other protein kinase structures in which 
the activation segment is not phosphorylated (reviewed by Johnson et al., 1996). The remaining C-terrninal 
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lobe elements, including ct-helices aD to al, are well ordered and adopt the prototypical protein kinase 

configuration. Terminating the catalytic domain structure is a short helix oil 

The EphB2 juxtamembrane region preceding the catalytic domain is highly ordered and adopts 

an identical conformation in the four unique environments sampled in the two different crystal forms 
5 studied. From the ammo-terminus, the conformation consists of an extended strand segment Exl, a single 

turn 3/10 helix aA', and a four-turn helix aB\ These elements associate intimately with helix ccC of the 

N-terminal catalytic lobe and also make limited interactions with the C-terminal lobe. As a consequence of 

the association of the juxtamembrane segment with the N-terminal kinase lobe, significant curvature is 

imposed on helix aC. This distortion couples directly to local distortions in other N-terminal lobe 
10 elements, most critically the g-loop and the invariant lysine-glutamate salt bridge. Together the N-terminal 

lobe distortions appear to impinge on catalytic function by adversely affecting the coordination of the 

sugar and phosphate groups of the bound nucleotide. 

With limited contacts to the lower lobe of the catalytic domain, the juxtamembrane segment also 

sterically impedes the activation segment from adopting the productive conformation that typifies the 
15 active state of protein-serine/threonine and tyrosine kinases. Together, the effects on nucleotide 

coordination and the activation segment form the basis for autoinhibition of EphB2 by the juxtamembrane 

segment 

Depending on the splice variant of EphB2, there are 29-45 juxtamembrane residues between the 
start of strand Exl (Lys 602) and the plasma membrane (Connor and Pasquale, 1995). This relatively 

20 lengthy sequence makes it impossible to predict whether the autoinhibited structure observed here would 
be oriented in a specific fashion with respect to the inner surface of the membrane. 
Detailed analysis of juxtamembrane structure 

The juxtamembrane strand segment Exl, corresponding to amino acid residues Lys 602 to He 
605, extends along the cleft region between the N- and C-terminal lobes (Figure 2c,d). The 

25 phosphpreguiatory residue Tyr/Phe 604 orients into a solvent-exposed hydrophobic pocket composed of 
the side chains of Met 748 and Tyr 750 of the C-terminal kinase lobe, Ee 681 and Phe 685 from helix aC 
and Pro 607 from 1he juxtamembrane helix aA\ This site has been termed 'switch region 1 9 since Tyr/Phe 
604 appears well placed to influence the association of the juxtamembrane region with the catalytic 
domain. Further stabilizing the interaction of strand Exl with the lower catalytic lobe are hydrogen bonds 

30 between the amide group of Tyr/Phe 604 and the carbonyl group of Met 748 and between the side chain of 
Gin 684 and the backbone amide and carbonyl groups of lie 605. 

Helix aA' is composed of a single rigid turn initiated by an Asp606Pro607 sequence and 
terminated by Thr 609. This helix appears stabilized by the conformational rigidity of Pro 607 and the 
capping interactions involving the side chains of Asp 606 and Thr 609 with the free backbone amino group 

35 and carbonyl groups of Phe 608 and Asp 606. A short linker and then a three-turn helix aB', initiated by 
Asp 612Pro613 and extending to Phe 620, follow helix ouV. Helix aB* is also initiated by an Asp Pro 
sequence (residues 612 and 613) and Asp 612 makes similar capping interactions with the backbone amino 
and side chain of Asn 614. Helices aA' and aB' form an interface with the N-terminal lobe of the kinase 
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that centers on helix aC. Hydrophobic side chains projecting from aA' and aB' include Pro 607, Phe 608, 
Pro 613, Val 617, Phe620 and Ala 621. These residues associate intimately with Arg 673, Leu 676, and He 
681 from helix aC and Leu 693 and Val 696 from P-strand 4. In addition, a hydrogen bond interaction 
(2.9A) is observed between Asn 614 and Arg 672 (Figure 2c), and the small side chains at positions 616 
5 (Ala), 677 (Ser) and 680 (Ser) facilitate the close packing of helices aA', ocB' and aC. 

Opposite to, but contiguous with, the site of association with helix aC, strand Exl and helices 
aA* and ccB' form an interface composed primarily of hydrophobic interactions. The side chain of the 
phosphoregulatory residue Tyr/Phe 610 projects onto the surface of this site and appears well positioned to 
exert an influence on the local juxtamembrane structure. This interface, termed 'switch region 2', is 
10 composed of the side chains of He 605 from strand Exl and the side chains of Ala 616 and Phe 620 from 
helix <xB\ 

Effect of the juxtamembrane engagement on the N-terminal lobe structure 

Comparison of the EphB2 crystal structure with that of the 'active* triply phosphorylated insulin 
receptor tyrosine kinase (active IRK (Hubbard, 1997)) indicates the mechanism by which the 

15 juxtamembrane region of EphB2 inhibits the catalytic domain. Superposition of C-terminal kinase lobe 
elements places the majority of N-terminal lobe elements into close correspondence (Figure 3a-d). A 
distinguishing feature of the EphB2 structure is a 14° kink midway along helix ccC centered at Glu 678. 
This kink, which coincides with the site of association with the juxtamembrane elements Exl, aA' and 
aB\ displaces the forward facing N-terminus of helix aC 6.8 A upward and outward from the equivalent 

20 position observed in IRK (Figure 3a,c). Stabilizing this kink internally are side chain/main chain 
interactions involving Ser 677 and Ser 680. 

The kink in helix aC places its forward projecting terminus in close proximity to P-strands 3, 4, 
and 5, forming a tighter interface than that observed in active IRK (Figure 3b). Residues participating in 
this interface include Arg 672, Phe 675, and Leu 676 from helix aC, Tyr 667 from the p3 / aC linker and 

25 Leu 663, Val 696, Thr 698, Val 703, and lie 705 from the p-strands. Interestingly, tyrosine 667, which is 
centrally positioned within this interface and is highly conserved amongst the Eph receptor family 
members, has been identified as an in vivo site of phosphorylation (Kalo and Pasquale, 1999), suggesting a 
possible phosphoregulatory role. 

The close association of helix aC with P-strands 3, 4 and 5 is achieved with a local alteration to 

30 the twist of the forward projecting termini of P-strands 1, 2 and 3 that leaves the bulk of the N-terminal 
sheet structure unperturbed. The g-loop side chain Phe 640 plays a role in coupling the p-strand 
movements to that of helix aC through a direct interaction with Phe 675. The altered twist of the pi, P2 
and P3-strand termini displace main chain atoms at the end of the g-loop (Glu 639 and Phe 640) by 
approximately 3.3 A. In addition, together with the kink in helix ccC, the altered twist of the p-strands 

35 displaces the invariant glutamate and lysine side chains by 2.4 and 2.1 A, respectively, relative to their 
positions in active IRK (Figure 3 c). As a consequence, the ability of the catalytic domain to coordinate the 
sugar and phosphate groups of bound nucleotide is compromised (Figure 3a-c). Since the domain closure 
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and the bulk of the N-terminal P-sheet structure is not perturbed, the adenine binding pocket is well formed 
and indeed the adenine base of bound AMP-PNP is ordered and orients in a manner similar to that in the 
crystal structure of active IRK. 

Steric influence of the juxtamembrane region on the activation segment 
5 While the majority of interactions between the juxtamembrane segment and the catalytic domain 

are directed towards the N-terminal lobe, strand Exl forms a limited set of interactions with the C-terminal 
lobe that may serve a regulatory role. Superposition of EphB2 with active IRK illustrates how the side 
chain of the phosphoregulatory residue Tyr/Phe 604 impedes the activation segment from adopting a 
productive conformation (Figure 3d). In autoinhibited EphB2, the side chain of Tyr 750 adopts an alternate 

10 conformation from that of the corresponding residue Phe 1128 in active IRK. This avoids a steric clash 
with the side chain of Tyr/Phe604. The alternate conformation of Tyr 750, in turn, impedes the activation 
segment from adopting a path observed in active IRK due to a steric clash with Ser 776 (Thr 1 154 in IRK). 
Interestingly, the side chain conformation of Tyr 750 in EphB2, Tyr 382 in Src and Hck, and Phe 1 128 in 
IRK all correlate with their activation segments adopting non-productive conformations. This may be 

15 indicative of a more general function in protein kinases for position 750 in regulating the conformation of 
the activation segment. 
The phosphoregulatory switch 

The ability to oscillate between catalyticaUy active and repressed states in a regulated manner is 
the key to the function of protein kinases as versatile molecular switches. In EphB2, EphA4, and most 

20 likely Eph RTKs in general, the switch to an active state is coordinated by phosphorylation at highly 
conserved sites within both the juxtamembrane region and the catalytic domain. The mechanism by which 
phosphorylation at sites within the activation segment stimulate protein kinases is relatively well 
^understood (reviewed by Johnson et al y 1996) and by inference, phosphorylation of EphB2 at Tyr 788 
likely promotes the ordering of the activation segment to a catalyticaUy competent conformation. 

25 In contrast, phosphorylation at Tyr/Phe 604 and 610 may serve to destabilize the juxtamembrane 

structure and cause it to dissociate from the catalytic domain. This would allow for a return of the N- 
terminal lobe to an undistorted active conformation. 

The EphB2 crystal structure helps to explain how phosphorylation at each of the two 
phosphoregulatory sites could destabilize the juxtamembrane structure and cause its release from the 

30 catalytic domain. The environment around each of the two switch regions is hydrophobic, but solvent 
exposed, and thus could accommodate either tyrosine or phenylalanine at positions 604 and 610 with little 
or no reorganization of the juxtamembrane structure. However, substitution with phosphotyrosine appears 
less tolerable due to steric and electrostatic clashes involving the bulky anionic phosphate group. In 
'switch region l\ the phosphorylation of Tyr/Phe 604 would place a phosphate group within van der 

35 Waals contact of Asp 606, Pro 607 and lie 681. Furthermore, the side chain of Asp 606 dominates the 
electrostatic environment around Tyr/Phe 604 such that the introduction of a phosphate group would 
generate repulsive electrostatic forces (Figure 4). The electrostatic environment around 'switch region T 
is also dominated by negatively charged amino acids, namely Asp 606, Glu 61 1, Asp 612, Glu 615, and 
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Glu 619. Thus, phosphorylation of Tyr 610 would also generate repulsive electrostatic forces, which are 
likely essential for the expulsion of this residue from its binding pocket since a phosphate group could be 
accommodated sterically. 

Three other highly conserved tyrosine residues have been identified as in vivo phosphorylation 
5 sites in EphB2 and EphB5, namely tyrosines 667, 744 and 750 (Figure 3c). Although their roles in 
regulating Eph receptor kinase activity have not been probed by mutagenesis, all three sites appear well 
positioned to influence the stability of the autoinhibited structure and hence Eph receptor activity (Figure 
3). For example, phosphorylation of Tyr 667 could promote a catalytically competent state by 
destabilizing the tight association of helix aC with p-strands 3, 4 and 5 observed in the autoinhibited state. 

10 In addition, phosphorylation of Tyr 744 and Tyr 750, which line the cleft region through which the 
juxtamembrane strand Exl navigates, could amplify the effect of phosphorylation at Tyr 604. 
Function of the EphA4 juxtamembrane segment probed by mutagenesis 

Previously, a cytoplasmic fragment of the EphA4 receptor tyrosine kinase, consisting of the 
juxtamembrane segment, the catalytic domain and the SAM domain, has been shown to require 

15 autophosphorylation for maximal activation (Binns et al., 2000). The importance of autophosphorylation 
was revealed by a lag period at the start of in vitro kinase reactions employing the dephosphorylated form 
of the EphA4 enzyme. This lag period was greatly reduced by pre-incubation of the EphA4 fragment with 
ATP or by deletion of the entire juxtamembrane segment In contrast, mutation to phenylalanine of either 
Tyr 604 or Tyr 610 reduced the specific activity of the enzyme, while mutation of both sites in tandem 

20 drastically impaired catalytic function (<10% relative to WT). These results are consistent with the 
mechanism of autoinhibition suggested by the EphB2 crystal structure. 

In order to test the crystallographic findings and to probe the regulation of Eph receptor catalytic 
activity in more detail, additional site-directed mutations were generated in the full-length murine EphB2 
receptor expressed in COS-1 cells and in a murine EphA4 receptor fragment expressed in bacteria, 

25 corresponding in content to the EphB2 construct used for the structure determination. For the sake of 
discussion, the murine EphB2 numbering scheme has been employed for all mutants and the corresponding 
EphA4 residue numbers are listed in parentheses. Each mutation was generated in the catalytically 
repressed Tyr 604/610 Phe double mutant background and was tested for its ability to restore catalytic 
function. The mutations include a small N-terminal deletion of residues 595 to 606 (AJX1) encompassing 

30 strand Exl and the first phosphoregulatory site, an intermediate N-terminal deletion of residues 599 to 610 
(AJX1+2) that encompasses strand Exl, the first phosphoregulatory site, helix aA> and the second 
phosphoregulatory site, and a full juxtamembrane segment deletion of residues 599 to 621 (AJXaii). In 
addition, six separate point mutations were generated in both the juxtamembrane region and the kinase 
domain (Pro607Gly, Phe608Asp, Phe620Asp, Tyr604/610Glu, Ser680Trp, GIn684Trp) that were predicted 

35 to destabilize the interaction of the kinase domain with the juxtamembrane segment. Lastly, the AJX1+2 
mutation was combined with the Phe620Asp mutation (AJX1+2 plus Phe620Asp) and the Ser680Trp 
mutation was combined with the Gln684Trp mutation (Ser680Trp/Gln684Trp). The Tyr604/610Phe double 
mutant and the wild type proteins were analyzed concomitantly as reference points for the fully repressed 
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(0%) and active (100%) states, respectively. The activities of the EphA4 proteins expressed in bacteria 
were tested for their ability to induce protein tyrosine phosphorylation in vivo (Figure 5a), and to 
autophosphorylate and to phosphorylate enolase in vitro (Figure 5b). EphA4 proteins were also tested for 
their ability to phosphorylate a peptide substrate using a continuous spectophotometric assay (Figure 5c). 
5 Lastly, full-length EphB2 proteins expressed in COS-1 cells were tested for their ability to 
autophosphorylate in vivo and to autophosphorylate and phosphorylate enolase in vitro (Figure 5d). 

The two partial N-terminal juxtamembrane deletions when introduced into the EphA4 construct 
significantly increased kinase activity in all four assays, restoring catalytic function as measured by the 
spectrophotometry assay to 136% and 216% of wild-type activity in the case of AJX1 and the AJX1 +2 
10 deletions, respectively. A similar effect was observed for the AJX1 +2 deletion introduced into full-length 
EphB2. 

Mutation of Phe 608 in EphA4, which locates to helix aA', gave very weak restoration of 
catalytic function. This result is consistent with the variability of position 608 amongst the Eph receptor 
family members (42% identity). In contrast, mutation in both EphA4 and EphB2 constructs of the highly 

15 conserved Pro 607 (95% identity), which initiates helix aA', to Gly greatly enhanced catalytic function in 
all four assays, quantitated at 122% of wild-type activity by the spectrophotometric assay. This result is 
consistent with a role for Pro607, suggested by the crystal structure, in stabilizing helix aA' by imposing 
conformational rigidity, or in promoting the association of juxtamembrane and N-terminal kinase lobe 
elements through hydrophobic interactions. Similarly, mutation of the highly conserved Phe 620 (95% 

20 identity) at the terminus of helix ccB' to Asp also restored catalytic function in the four assays tested Phe 
620 is notable because it contributes to the hydrophobic pocket into which the phosphoregulatory residue 
Tyr/Phe 610 binds; its substitution with Asp is predicted to disrupt the hydrophobic interaction with 
Tyr/Phe 610, and to clash electrostatically with the surrounding negatively charged groups in a manner 
mimicking phosphorylation of Tyr/Phe 610. 

25 The introduction of point mutations into the kinase domain at the interface with the 

juxtamembrane region also restored catalytic function. Mutation of Ser680 (82% identity) to Trp in both 
EphA4 and EphB2 constructs gave modest restoration with the phosphorylation of peptide substrate being 
restored to 41% of wild-type activity. Mutation of the absolutely conserved Gln684 (100% identity) to Trp 
in EphB2 resulted in a greater increase in kinase activity, as did the double mutation Ser 

30 680Trp/Gln684Trp. Both mutations map to helix otC and are predicted to sterically perturb the association 
of the juxtamembrane region with the N-terminal catalytic lobe. 

Robust restoration of activity was also observed for the EphA4 and EphB2 mutants AJX^ 
Tyr604/610Glu, and AJX1+2 plus Phe620Asp, although the relative restoration as measured by the various 
assays differed to a small degree. The restoration of activity by the AJXau mutant corofirms that the 

35 juxtamembrane segment is not absolutely required for kinase function, the restoration by the 
Tyr604/610Glu mutation suggests that the addition of negative charges at positions 604 and 610 is an 
important component of juxatmembrane destabilization and the relief of autoinhibition. Lastly, the finding 
that none of the EphB2 mutants are as active as the wild-type enzyme may indicate that these mutants have 
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perturbed some aspect of the oligomerization event that is needed for maximal activation of the full-length 
receptor. 

Overall, the mutagenesis results support a model for the regulation of receptor catalytic function 
by the juxtamembrane segment, shown in Figure 6. Strand Exl and helix ctA' of the juxtamembrane 
5 segment contribute to the inhibitory effect on the catalytic domain, and the release of these elements from 
their association with the catalytic domain is a requirement for catalytic activation. Physiologically, this 
would be accomplished by phosphorylation at the Tyr 604 and 610 regulatory sites and potentially at 
additional sites. The strong conservation of residues involved in the inhibitory interaction suggests that 
this regulatory mechanism is conserved for all Eph receptor family members. 

10 Comparison of Autoinhibitory Mechanisms of EphB2 and TGFpRl Receptor Kinase 

Analysis of the TGFPR1 serine/threonine kinase has revealed a role for the juxtamembrane 
Gly/Ser/Thr-rich motif ("GS segment") in regulating catalytic activity. As with Eph receptor tyrosine 
kinases, TGFpRl kinases require phosphorylation at sites within the juxtamembrane segment for 
subsequent phosphorylation of target Smad proteins (Macias-Silva et al, 1996). The regulatory mechanism 

15 revealed by the X-ray crystal structure of a cytoplasmic fragment of TGFpRl in complex with FKBP12 
(Huse et al, 1999) shows some parallels to EphB2. In both structures, the intramolecular engagement of the 
juxtamembrane segment induces conformational distortions in the catalytic domain that impinge on kinase 
function. In addition, the induced distortions impact on the relative positioning and/or conformation of 
helix aC. Beyond these similarities, however, the inhibitory mechanisms, including the mode of 

20 juxtamembrane association with the catalytic domain and the resulting basis for inhibition, diverge. 
Perhaps the most significant difference relates to the potential involvement of FKBP12 in stabilizing the 
inhibited structure of TGFpRl, whereas EphB2 achieves an autoinhibited state independently. 
Nonetheless, the data for EphB2 indicate that receptor tyrosine kinases and receptor serine/threonine 
kinases have in some cases converged on a related regulatory mechanism in which the juxtamembrane 

25 region inhibits the kinase domain in the inactive state, and is potentially liberated to interact with 
downstream targets upon autophosphorylation. 
Discussion 

Why does EphB2 employ a rather complex mechanism of autoregulation, involving the non- 
catalytic juxtamembrane region? One possible benefit may be to block any potential signaling activity 

30 intrinsic to the juxtamembrane sequence. In particular, phosphorylation of tyrosines 604 and 610 in EphB2 
creates docking sites for SH2 domain proteins. Sequestering these tyrosines decreases their chance of 
becoming adventitiously phosphorylated and thereby inappropriately transmitting a signal through the 
recruitment of downstream targets. The coordination of kinase activation with the release of binding sites 
for targets is reminiscent of Src family cytoplasmic tyrosine kinase, in which the SH2 and SH3 domains 

35 engage internal ligands in a fashion that both inhibits the activity of the kinase domain and hinders 
interactions of the SH2 and SH3 domains with other binding partners (Sicheri et al., 1997; Xu et al., 1997). 

The involvement of the juxtamembrane sequence in autoregulation of EphB2 activity may also set 
a phosphorylation threshold that must be exceeded to induce receptor activation. Full stimulation of Eph 
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receptors apparently requires autophosphorylation at multiple sites within both the activation segment and 
juxtamembrane region. The use of at least two distinct phosphoregulatory steps may preclude 
inappropriate Eph receptor activation resulting from basal levels of kinase activity. Since Eph receptors 
have powerful biological activities during embryogenesis and postnatally, their aberrant activation would 
5 be expected to have severe phenotypic consequences, which could be avoided by requiring multi-site 
phosphorylation of the receptor. 

Are the Eph receptors unique among RTKs in employing cytoplasmic elements outside the 
catalytic domain to regulate kinase activity? A variety of data obtained for the platelet-derived growth 
factor P receptor (PDGFR), the closely related colony stimulating factor-1 receptor (c-Fms), stem cell 

10 factor receptor (Kit), and the Fit3 receptor raise the possibility that this may in fact be a more widespread 
phenomenon. Biochemical analysis and mutagenesis of the PDGFR-P has suggested that 
autophosphorylation of juxtamembrane tyrosines 579 and 581 is required for stimulation of receptor kinase 
activity by PDGF, potentially by allowing subsequent phosphorylation of tyrosine 857 in the activation 
segment (Baxter et al., 1998). Conversion of these juxtamembrane tyrosines to phenylalanine inhibits 

15 receptor activation, while their phosphorylation creates a binding site for the Src SH2 domain, resulting in 
Src recruitment to the receptor. Thus, autophosphorylation within the juxtamembrane region of the 
PDGFR-P may couple receptor activation to the exposure of SH2 domain-binding sites, as appears to be 
me case for Eph receptors. Consistent with the notion that the juxtamembrane region of the PDGFR-p 
exerts an inhibitory influence on kinase activity, substitution of a valine residue, just N-terminal to the 

20 regulatory tyrosines, results in constitutive receptor activation in vitro and in vivo (Irusta and DiMaio, 
1998). In addition to the PDGFR-P, the juxtamembrane regions of c-Fms (Myles et al., 1994), Kit, and FIG 
receptors have been implicated in regulation of tyrosine kinase activity. Oncogenic variants of Kit 
identified in human and murine mast cell leukemias carry either amino acid substitutions or deletions in 
the juxtamembrane region, which result in constitutive activation of the kinase domain (Tsujimura et al., 

25 1996)(see Figure 1). Remarkably a majority of human gastrointestinal stromal tumors (GIST) have 
activating Kit mutations that introduce substitutions or deletions into a short segment of the 
juxtamembrane region, and are strongly implicated in the etiology of these rumors (Hirota et al, 1998; 
Nakahara et al; 1998; Anderson, 1998), Furthermore, approximately 20% of acute myeloid leukemias have 
internal tandem duplications of Flt3 that create in-frame insertions of variable length in the juxtamembrane 

30 region, leading to Iigand-independent kinase activity and oncogenic acitvation (Nakao et al, 1996; Yokota 
et al, 1997; Hayakawa et al, 2000). Thus, Kit and Flt3 juxtamembrane regions may repress kinase activity, 
and juxtamembrane mutations that relieve this inhibition can result in human cancers. 

A similar situation may pertain for the insulin receptor, which upon activation becomes 
autophosphorylated within the juxtamembrane region and consequently binds targets such as IRS-1 and 

35 ShcA, which possess PTB domains. Kinetic analysis of wild type and mutant insulin receptors has 
suggested that the insulin receptor juxtamembrane region acts as an intrasteric inhibitor to block the kinase 
domain active site, in a fashion that is relieved by autophosphorylation of juxtamembrane tyrosines (Cann 
etal., 2000). 
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Many RTKs have C-tenninal tails that upon activation become phosphorylated at SH2/PTB 
domain-binding sites. Structural analysis of the Tie2/Tek receptor cytoplasmic region has indicated that in 
the inactive state the tail interacts with the kinase domain in a way that partially occludes the C-tenninal 
tyrosines and the peptide binding site (Shewchuk et al., 2000). This raises the possibility that 
5 autophosphorylation of the Tie2 tail causes a conformational change that exposes both C-terminal 
phosphotyrosine sites as well as the substrate binding site of the kinase domain. 

Thus the juxtamembrane and C-terminal segments of RTKs may play a pivotal role in regulating 
the kinase domain, and in coordinating enzymatic activation with the exposure of motifs that bind 
cytoplasmic targets. 

10 In addition to revealing an unexpected level of complexity in the regulation of RTKs, these 

observations have interesting implications for the design of RTK inhibitors. The structure of the Abl kinase 
bound to the inhibitor STI-571 suggests that this compound binds selectively to the inactive form of the 
kinase (Schindler et al., 2000). The unusual structure of autoinhibited EphB2 suggests the possibility of 
isolating inhibitors that bind specifically to the inactive conformation of the kinase. Indeed, if this mode of 

15 intrasteric regulation is a more common feature of RTKs, this might be a general strategy for the 
identification of selective RTK inhibitors. 

The structure of EphB2 reveals an entirely novel mechanism for RTK autoregulation. 

All publications mentioned in the above specification are herein incorporated by reference. 

20 Various modifications and variations of the described methods and system of the invention will be 
apparent to those skilled in the art without departing from the scope and spirit of the invention. Although 
the invention has been described in connection with specific preferred embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which are obvious to 

25 those skilled in chemistry, biology or related fields are intended to be within the scope of the following 
claims. 
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Table 2. Intermolecular contacts of the Juxtamembrane Region and Kinase Domain of an Eph Receptor 



No. of 


Juxtamembrane 


Kinase 


Distance 


Atomic Interaction 


Atomic 


Region 


Domain/Jurta- 


Between 


Pronertv 


Interactio 


Atomic Contact 


membrane 


Atomic 


n 




Region Atomic 


Contacts 








Contact 






1 


Phe/Tyr 604 CB 


Met 748 CE 


4.58 


hydrophobic 


2 


Phe/Tyr 604 N 


Met 748 O 


2.83 


H-bond 


3 


Phe/Tyr 604 CD2 


Tyr 750 CD1 


3.78 


Hydrophobic 


4 


Phe/Tyr 604 CE2 


Tyr750 CE1 


4.12 


Hydrophobic 


5 


Phe/Tyr 604 CD1 


Phe 685 CE2 


4.06 


Hydrophobic 


6 


He 605 N 


Gin 684 OE1 


2.83 


H-bond 


7. 


He 605 O 


Gin 684 NE2 


3.00 


H-bond 


8 


Phe/Tyr 604 CE1 


Gin 684 CD 


4.11 


van der Waal 


9 


Pro 607 CD 


He 681 CGI 


3.85 


Hydrophobic 


10 


Pro 607 CB 


Ser 680 OG 


3.16 


van der Waal 


11 


Phe 608 CZ 


Asp 674 OD1 


3.27 


van der Waal 

V ill! UWI T T UOi 


12 


Phe 608 CZ 


Ser 677 CB 


4.35 


van der Waal 


13 


Phe 608 CE2 


Arg 673 CG 


3.79 


Hydrophobic 


14 


Pro 613 CB 


Arg 673 CD 


3.60 


Hydrophobic 


15 


Asn614 OD1 


Arg 672 NH1 


2.87 


H-bond 


16 


Val 617 CG2 


Leu 676 CD1 


4.69 


Hydrophobic 


17 


Val 617 CG2 


Ser 680 CB 


4.15 


Hydrophobic 


18 


Val 617 CG2 


Leu 676 CB 


4.10 


Hydrophobic 


19 


Ala 621 CB 


Leu 693 CD2 


3.98 


Hydrophobic 


20 


Phe 620 CD1 


Gin 684 CG 


3.60 


Hydrophobic 


21 


Phe 620 CE1 


Gin 684 CD 


3.77 


Hydrophobic 


22 


Phe 620 CB 


Gin 683 O 


4.15 


van der Waal 


23 


Phe 620 O 


Gin 683 0 


3.41 


H-bond 


24 


Ala 616 CA 


Ser 680 CB 


3.8 


Hydrophobic 


25 


Tyr/Phe 604 CE2 


Asp 606 CB 


4.26 


Hydrophobic 


26 


Pro 607 CD 


Asp 606 OD1 


3.28 


van der Waal 


27 


Asp 606 O 


Thr609 OG1 


2.72 


Hydrogen bond 


28 


Asp 606 O 


Thr609N 


2.90 


Hydrogen bond 


29 


Asp 606 CB 


Thr 609 CG2 


3.56 


Hydrophobic 


30 


Phe 604 CZ 


Pro 607 CD 


3.90 


Hydrophobic 


31 


Pro 607 O 


Phe 610 N 


3.08 


Hydrogen bond 


32 


Phe 608 CD2 


Pro 613 CG 


4.43 


Hydrophobic 


33 


Phe 610 CE1 


He 605 CG2 


3.48 


Hydrophobic 


34 


Phe 610 CZ 


Phe 620 CE1 


3.91 


Hydrophobic 


35 


Phe 610 CE1 


Ala 616 CB 


3.91 


Hydrophobic 


36 


Phe 610 CD2 


Glu 615 CG 


3.74 


Hydrophobic 


37 
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Glu 615 N 


2.73 


Hydrogen bond 


38 


Phe 608 N 
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2.83 


Hydrogen Bond 


39 
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3.09 


Hydrogen bond 


40 
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3.11 


Hydrogen bond 


41 


Asn614Q 


Arg 618 N 


3.16 


Hydrogen bond 


42 


Pro 613 O 


Val 617 N 


3.59 


Weak hydrogen bond, van 










der Waal 


43 


Glu615 0 


Glu 619 N 


3.06 


Hydrogen Bond 


44 


Glu615 0El 


Glu619 0El 


2.64 


Hydrogen Bond 


45 


Phe620N 


Ala 616 O 


2.96 


Hydorgen Bond 


46 


Glu 619 OE2 


Phe 620 CZ 


4.03 


Van der waals 


47 


Ala 621 N 


Val 617 O 


2.91 


Hydrogen Bond 



WO 02/061055 



PCT/CA02/00114 



62 



48 


Ala 6210 


Val617 0 


3.25 


Hydrogen Bond ^ 


49 


Ala 621 CB 


Val 617 CGI 


4.16 


Hydrophobic 1 
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refinement resolution: 30 - 1,9 A 
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19, 


.07 


A 


ATOM 


1885 


O 


LEU A 


863 


12.595 


13.005 


0 . 641 


1 . 00 


20, 


.62 


A 


ATOM 


1886 


N 


HIS A 


864 


14.434 


13. 683 


1 .741 


1 . 00 


19.85 


A 

t\ 


ATOM 


1887 


CA 


HIS A 


864 


13.643 


14 . 448 


2 .686 


1.00 


18, 


.66 


A 


ATOM 


1888 


CB 


HIS A 


864 


14.496 


14. 954 


3.847 


1.00 


19, 


,81 


A 


ATOM 


1889 


CG 


HIS A 


864 


13.700 


15 . 626 


4 . 922 


1 . 00 


19, 


.98 


n 
t\ 


ATOM 


1890 


CD2 


HIS A 


864 


12.826 


15. 127 


5.828 


1 . 00 


20. 


.76 


j± 


ATOM 


1891 


NDl 


HIS A 


864 


13.735 


16. 988 


5.133 


1.00 


21. 


.33 




ATOM 


1892 


CE1 


HIS A 


864 


12.919 


17 .299 


6.125 


1 . 00 


22. 


.47 




ATOM 


1893 


NE2 


HIS A 


864 


12.355 


16.187 


6 .564 


1.00 


22, 


.76 


A 


ATOM 


1894 


C 


HIS A 


864 


12.988 


15. 619 


1. 983 


1.00 


19. 


.62 


A 


ATOM 


1895 


O 


HIS A 


864 


11.897 


16. 052 


2.366 


1.00 


19. 


,61 


A 


ATOM 


1896 


N 


GLN A 865 


13.641 


16.143 


0. 952 


1.00 


19. 


,53 


A 


ATOM 


1897 


CA 


GLN A 


865 


13.039 


17.259 


0.238 


1.00 


19. 


79 


A 


ATOM 


1898 


CB 


GLN A 


865 


13.977 


17 .808 


-0.837 


1.00 


19, 


,80 


A 


ATOM 


1899 


CG 


GLN A 


865 


13.379 


18 . 989 


-1.567 


1 . 00 


21. 


23 


A 


ATOM 


1900 


CD 


GLN A 


865 


13.013 


20. 114 


-0 . 618 


1.00 


22. 


98 


7\ 


ATOM 


1901 


OE1 


GLN A 


865 


13.868 


20. 627 


0 . 105 


1 . 00 


26. 


38 


iT. 


ATOM 


1902 


NE2 


GLN A 


865 


11.739 


20. 504 


-0. 613 


1 . 00 


21. 


44 


A 


ATOM 


1903 


C 


GLN A 


865 


11. 766 


16. 750 


-0.416 


1 . 00 


19. 


00 


7\ 

n. 


ATOM 


1904 


O 


GLN A 


865 


10.735 


17 . 419 


-0 .396 


1 . 00 


19. 


67 


7\ 
■t\ 


ATOM 


1905 


N 


LEU A 


866 


11. 841 


15. 559 


-0 . 997 


1.00 


19. 


26 


7\ 
t\ 


ATOM 


1906 


CA 


LEU A 


866 


10. 663 


14 . 988 


-1 . 632 


1 . 00 


18. 


45 


H. 


ATOM 


1907 


CB 


LEU A 


866 


11.014 


13 . 649 


-2 .283 


1 . 00 


18. 


07 




ATOM 


1908 


CG 


LEU A 


866 


9.887 


12 . 930 


-3 . 021 


1 . 00 


19. 


78 


7\ 
r\ 


ATOM 


1909 


CD1 


LEU A 


866 


9.168 


13. 903 


-3 . 967 


1 . 00 


18. 


91 




ATOM 


1910 


CD2 


LEU A 


866 


10.467 


11.754 


-3 .775 


1.00 


20. 


97 




ATOM 


1911 


C 


LEU A 


866 


9.546 


14 . 824 


-0 .589 


1 .00 


18. 


47 




ATOM 


1912 


O 


LEU A 


866 


8.367 


15.026 


-0 . 890 


1.00 


17. 


51 


A 


ATOM 


1913 


N 


MET A 


867 


9.921 


14 . 476 


0. 641 


1.00 


18. 


09 


j± 


ATOM 


1914 


CA 


MET A 


867 


8.946 


14 . 318 


1 .711 


1 . 00 


18. 


01 


A 

ri. 


ATOM 


1915 


CB 


MET A 


867 


9.632 


13 .865 


3 . 005 


1 . 00 


19. 


66 


7\ 
rl 


ATOM 


1916 


CG 


MET A 


867 


10. 077 


12. 419 


2 . 991 


1 . 00 


20. 


12 


A 


ATOM 


1917 


SD 


MET A 


867 


10. 968 


11. 976 


4 . 495 


1 . 00 


22. 


40 


r\ 


ATOM 


1918 


CE 


MET A 


867 


12.235 


10. 954 


3 . 814 


1 . 00 


22. 


54 


J± 


ATOM 


1919 


C 


MET A 


867 


8.228 


15. 640 


1 . 965 


1.00 


18. 


98 


A 


ATOM 


1920 


O 


MET A 


867 


6.996 


15. 686 


2 . 058 


1 . 00 


17. 


83 


^ 


ATOM 


1921 


N 


LEU A 


868 


9. 007 


16.712 


2 . 089 


1 on 


18. 


92 




ATOM 


1922 


CA 


LEU A 


868 


8.447 


18. 042 


2 . 326 




20. 


03 


n 


ATOM 


1923 


CB 


LEU A 


868 


9.566 


19. 083 


2 .455 


i on 


21. 


62 


7\ 
r\ 


ATOM 


1924 


CG 


LEU A 


868 


10.538 


18.913 


3.628 


1.00 


22.08 


A 


ATOM 


1925 


CD1 


LEU A 


868 


11.607 


20.012 


3.578 


1.00 


21. 


68 


A 


ATOM 


1926 


CD2 


LEU A 868 


9.772 


18.989 


4.942 


1.00 


18. 


97 


A 


ATOM 


1927 


C 


LEU A 868 


7.506 


18.444 


1.191 


1.00 


19.33 


A 


ATOM 


1928 


O 


LEU A 868 


6.501 


19.111 


1.420 


1.00 


20. 


63 


A 


ATOM 


1929 


N 


ASP A 


8 69 


7.836 


18.037 


-0.031 


1.00 


17. 


79 


A 


ATOM 


1930 


CA 


ASP A 


869 


7.001 


18.360 


-1.183 


1.00 


18. 


56 


A 


ATOM 


1931 


CB 


ASP A 869 


7.707 


17.971 


-2.484 


1.00 


17. 


65 


A 


ATOM 


1932 


CG 


ASP A 


869 


8.988 


18.762 


-2.703 


1.00 


18. 


80 


A 


ATOM 


1933 


OD1 


ASP A 869 


9.175 


19.790 


-2.021 


1.00 


16.81 


A 


ATOM 


1934 


OD2 


ASP A 


869 


9.799 


18.364 


-3.557 


1.00 


18. 


79 


A 



WO 02/061055 



PCT/CA02/00114 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ATOM 


1935 


C 


ASP A 


869 


94 
5.664 


17.651 


-1.072 


1.00 


20. 40 


a 


ATOM 


1936 


O 


ASP A 


869 


4.631 


18.197 


-1.460 


1.00 


21.71 


A 


ATOM 


1937 


N 


CYS 


A 


870 


5.682 


16.439 


-0.524 


1 .00 


18.78 


a^ 


ATOM 


1938 


CA 


CYS 


A 


870 


4.456 


15. 670 


-0.354 


1.00 


19.36 


n 


ATOM 


1939 


CB 


CYS 


A 


870 


4.769 


14.201 


-0.039 


1 . 00 


15. 21 




ATOM 


1940 


SG 


CYS 


A 


870 


5.477 


13. 278 


-1.422 


1.00 


17 . 93 




ATOM 


1941 


C 


CYS A 


870 


3. 616 


16.256 


0.769 


1 . 00 


18. 15 


A 


ATOM 


1942 


O 


CYS 


A 


870 


2.390 


16.090 


0.785 


1.00 


20. 18 


A 


ATOM 


1943 


N 


TRP 


A 


871 


4 .259 


16. 956 


1 .702 


1 .00 


19. 04 


a 


ATOM 


1944 


CA 


TRP A 


871 


3.529 


17 . 534 


2.833 


1 .00 


20 40 


a 


ATOM 


1945 


CB 


TRP 


A 


871 


4.306 


17 . 335 


4 . 151 


1 . 00 


20 ,01 


7V 

M. 


ATOM 


1946 


CG 


TRP 


A 


871 


4 . 663 


15. 890 


4 . 480 


1 . 00 


21 . 12 




ATOM 


1947 


CD2 


TRP 


A 


871 


5. 874 


15. 424 


5.095 


1 . 00 


19 . 64 




ATOM 


1948 


CE2 


TRP 


A 


871 


5.778 


14. 014 


5.204 


1 . 00 


18 . 64 




ATOM 


1949 


CE3 


TRP 


A 


871 


7 . 029 


16. 060 


5.563 


1 . 00 


IS S4 

XO . OH 




ATOM 


1950 


CD1 


TRP 


A 


871 


3. 897 


14.770 


4 . 254 


i on 

X • UU 


21 2 fi 




ATOM 


1951 


NE1 


TRP 


A 


871 


4 .567 


13. 642 


4 . 685 


1 . 00 


19 40 


a 


ATOM 


1952 


CZ2 


TRP 


A 


871 


6. 797 


13. 232 


5 . 762 


1 . 00 


1ft 7fi 
X O . I o 


7\ 


ATOM 


1953 


CZ3 


TRP 


A 


871 


8. 045 


15.282 


6. 120 


i on 

X . u u 


1Q CO 

X ZJ . D J 


7\ 

A 


ATOM 


1954 


CH2 


TRP 


A 


871 


7 . 920 


13 . 878 


6. 214 


l nn 

X . uu 


1 Q Q7 




ATOM 


1955 


C 


TRP 


A 


871 


3. 168 


19 . 012 


2 . 669 


i on 

X . uu 


20 ftfl 


ii 


ATOM 


1956 


O 


TRP 


A 


871 


2 . 999 


19. 736 


3 . 654 


1 nn 

X . UU 


21 nfi 


a 


ATOM 


1957 


N 


GLN 


A 


872 


3 . 055 


19. 459 


1 ♦ 424 


i nn 

X ■ UU 


22 7 A 

&£. . / H. 


A 


ATOM 


1958 


CA 


GLN 


A 


872 


2 . 680 


20.846 


1 . 157 


1 oo 


2^ 2 ^ 


A 


ATOM 


1959 


CB 


GLN 


A 


872 


2 .768 


21 . 158 


-0 . 337 


i nn 

X • uu 


21 Ql 


A 


ATOM 


1960 


CG 


GLN 


A 


872 


4 . 174 


21 . 383 


— o P1 


i on 
x . uu 


2 Oj 01 
£. z> . Z / 


A 


ATOM 


1961 


CD 


GLN 


A 


872 


4 . 857 


22 . 495 


-fi 047 


1 oo 
x . uu 


12 A ^ 


A 


ATOM 


1962 


OE1 


GLN 


A 


872 


4 . 377 


23 . 628 


-0 . 015 


i on 

X . uu 


14 77 
OH .11 


TV 

A 


ATOM 


1963 


NE2 


GLN 


A 


872 


5. 979 


22 . 178 


0 . 579 


1 oo 
X . uu 


1 R 2 Q 


A 


ATOM 


1964 


c 


GLN 


A 


872 


1 . 260 


21 . 073 


1 . 639 


1 on 

X ■ uu 




A 

A 


ATOM 


1965 


O 


GLN 


A 


872 


0.351 


20 . 310 


1 . 301 


i nn 

X - uu 


2fi R R 


A 


ATOM 


1966 


N 


LYS 


A 


873 


1. 081 


22 . 112 


2.445 


i nn 

X ■ uu 


2fi 47 

£. O . ft / 


A 


ATOM 


1967 


CA 


LYS 


A 


873 


-0 . 224 


22 . 463 


2 . 990 


1 nn 

X . uu 


27 Q 


a 
A 


ATOM 


1968 


CB 


LYS 


A 


873 


-0 . 139 


23.849 


O • OO zt 


1 oo 
X . uu 


2fl 61 

6 0 . OX 


A 


ATOM 


1969 


CG 


LYS 


A 


873 


-1 .390 


24 . 304 


4 . 360 


1 oo 
X • uu 


2 Q RJ 


A 


ATOM 


1970 


CD 


LYS 


A 


873 


-1 . 188 


25. 685 


4 . 968 


o on 
u . uu 


in An 

3 U . ft U 


a 
A 


ATOM 


1971 


CE 


LYS 


A 


873 


-2 . 412 


26 . 130 


5 . 752 


n nn 
u • uu 


in on 


a 
A 


ATOM 


1972 


NZ 


LYS 


A 


873 


-2 . 223 


27 . 477 


6. 360 


n nn 

u • UU 


11 91 
OX . c. J 


a 
A 


ATOM 


1973 


C 


LYS 


A 


873 


-1.281 


22 . 446 


1 . 886 


i nn 

X . UU 


2Q 41 


a 
A 


ATOM 


1974 


O 


LYS 


A 


873 


-2.320 


21 .800 


2 . 022 


i nn 

X . UU 


2ft ni 


A 


ATOM 


1975 


N 


ASP 


A 


874 


-1 . 008 


23 . 156 


O 7Q1 

V . f so 


i nn 
x • uu 


lO A 1 

JU . ft 1 


a 
A 


ATOM 


1976 


CA 


ASP 


A 


874 


-1 . 930 


23 . 211 


-0 . 340 


1 oo 
X • uu 


19 1 1 
Od. . XO 


a 
A 


ATOM 


1977 


CB 


ASP 


A 


874 


-1 .585 


24 . 399 


-1 . 247 


1 oo 
X . uu 


14 OT 
OH ■ U / 


A 


ATOM 


1978 


CG 


ASP 


A 


874 


-2 . 640 


24 . 656 


-2 . 304 


1 oo 
X » u u 


Ifi 99 


a 
A 


ATOM 


1979 


OD1 


ASP 


A 


874 


-3 . 160 


23 . 678 


-2 . 887 


1 oo 
X . u u 


16 11 
JO. JJ 


A 


ATOM 


1980 


OD2 


ASP 


A 


874 


-2 . 944 


25.844 


-2 . 560 


1 oo 
X . uu 


1Q QC 


A 


ATOM 


1981 


c 


ASP 


A 


874 


-1 . 806 


21 ftlQ 


-1.14 6 


1 oo 
X . uu 


i o on 

J U » 71 


A 


ATOM 


1982 


O 


ASP A 


874 


-0 . 758 


21 . 653 


-1 727 


1 oo 
x . UU 


lO 1 1 


A 


ATOM 


1983 


N 


ARG 


A 


875 


-2.876 


21 114 


— 1 2D7 

X . £.\J 1 


1 oo 


lO 

JU . / O 


A 


ATOM 


1984 


CA 


ARG A 


875 


-2 817 

£. . O O i 


1 q A7n 


—1 Q1Q 

— X • Z) O Z7 


1 oo 
X . uu 


lO Ql 


A 


ATOM 


1985 


CB 


ARG A 


875 


-4 1 77 

1 . X / f 


1 Q 1 1ft 

X :7 * X OO 


-1 ft!9 
—x . OJ^ 


1 oo 
X . uu 


lo oo 
Jz . 


A 


ATOM ' 


1986 


CG 


ARG A 


875 


-5 ^16 


1 ft 7SS 


— 2 fim 


1 oo 
X . uu 


Jj . Uo 


A 


ATOM 


1987 


CD 


ARG A 


875 


-fi 411 

\J * H J.O 


1 ft 722 


-2 R4 A 


i on 
X . uu 




A 


ATOM 


1988 


NE 


ARG A 


875 


-7.555 


19.234 


-3.603 


1 .00 


43.73 


A 


ATOM 


1989 


CZ 


ARG A 


875 


-8.448 


20.094 


-3.125 


1.00 


45.20 


A 


ATOM 


1990 


NH1 


ARG A 


875 


-8.334 


20.544 


-1.883 


1.00 


45.87 


A 


ATOM 


1991 


NH2 


ARG A 


875 


-9.457 


20.502 


-3.885 


1.00 


47.30 


A 


ATOM 


1992 


C 


ARG A 


875 


-2.477 


20.028 


-3.412 


1.00 


28.33 


A 


ATOM 


1993 


O 


ARG 


A 


875 


-1.965 


19.100 


-4.033 


1.00 


26.80 


A 


ATOM 


1994 


N 


ASN A 


876 


-2.756 


21.199 


-3.972 


1.00 


27.63 


A 


ATOM 


1995 


CA 


ASN A 


876 


-2.463 


21.457 


-5.375 


1.00 


27.11 


A 


ATOM 


1996 


CB 


ASN A 


87 6 


-3.223 


22.703 


-5.845 


1.00 


29.67 


A 


ATOM 


1997 


CG 


ASN A 


876 


-4.663 


22.396 


-6.237 


1.00 


32.23 


A 


ATOM 


1998 


OD1 


ASN A 


876 


-5.558 


23.221 


-6.060 


1.00 


36.30 


A 
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ATOM 


1999 


ND2 


ASN 


A 


876 


95 

-4.887 


21.212 


-6. 


786 


1.00 


33.53 


A 


ATOM 


2000 


C 


ASN A 


876 


u . 


y / c, 


21.621 


-5. 


632 


1.00 


25.93 


A 


ATOM 


2001 


O 


ASN 


A 


876 


u . 




21.396 


-6. 


746 


1.00 


26.98 


A 


ATOM 


2002 


N 


HIS 


A 


877 




Zoo 


21.996 


-4. 


592 


1.00 


24.62 


A 


ATOM 


2003 


CA 


HIS 


A 


877 


1 . 




22.203 


-4. 


690 


1.00 


23.95 


A 


ATOM 


2004 


CB 


HIS 


A 


877 


1 . 


bob 


23.185 


-3. 


605 


1.00 


25.91 


A 


ATOM 


2005 


CG 


HIS 


A 


877 


1 . 


103 


24.566 


-3. 


758 


1.00 


28.62 


A 


ATOM 


2006 


CD2 


HIS 


A 


877 


0 . 


390 


25.134 


-4. 


760 


1. 00 


29.02 


A 


ATOM 


2007 


ND1 


HIS 


A 


877 


1 . 


268 


25.548 


-2. 


803 


1. 00 


30. 92 


A 


ATOM 


2008 


CE1 


HIS 


A 


877 


0 . 


682 


26. 659 


-3. 


212 


1. 00 


32. 07 


A 


ATOM 


2009 


NE2 


HIS 


A 


877 


0 . 


142 


26. 434 


-4. 


396 


1. 00 


30. 42 


A 


ATOM 


2010 


C 


HIS 


A 


877 


1 . 


987 


20. 893 


-4. 


555 


1. 00 


24 . 02 


A 


ATOM 


2011 


■ O 


HIS 


A 


877 


3 . 


169 


20.821 


-4. 


919 


1.00 


21. 47 


A 


ATOM 


2012 


N 


ARG 


A 


878 


1 . 


342 


19.860 


-4. 


017 


1.00 


22.45 


A 


ATOM 


2013 


CA 


ARG A 


878 


2 . 


022 


18.574 


-3. 


858 


1.00 


20.77 


A 


ATOM 


2014 


CB 


ARG 


A 


878 


1 . 


196 


17.618 


-2. 


995 


1.00 


18.58 


A 


ATOM 


2015 


CG 


ARG A 


878 


0 . 


878 


18.140 


-1. 


609 


1.00 


17.53 


A 


ATOM 


2016 


CD 


ARG 


A 


878 


-0 . 


163 


17.267 


-0. 


922 


1. 00 


17. 61 


A 


ATOM 


2017 


NE 


ARG 


A 


878 


-0. 


678 


17. 923 


0. 


273 


1. 00 


15. 99 


A 


ATOM 


2018 


CZ 


ARG 


A 


878 


-1 . 


898 


17.746 


0. 


763 


1.00 


17.79 


A 


ATOM 


2019 


NH1 


ARG 


A 


878 


-2 . 


743 


16. 913 


0. 


165 


1.00 


18. 10 


A 


ATOM 


2020 


NH2 


ARG 


A 


878 


-2 . 


282 


18.440 


1. 


830 


1. 00 


20. 88 


A 


ATOM 


2021 


C 


ARG 


A 


878 


2 . 


264 


17.922 


-5. 


208 


1.00 


20. 33 


A 


ATOM 


2022 


O 


ARG 


A 


878 


1 . 


473 


18.058 


-6. 


137 


1.00 


21.44 


A 


ATOM 


2023 


N 


PRO 


A 


879 


3 . 


377 


17.200 


-5. 


334 


1.00 


19.46 


A 


ATOM 


2024 


CD 


PRO A 


879 


4 . 


395 


16. 893 


-4 . 


307 


1.00 


20.47 


A 


ATOM 


2025 


CA 


PRO 


A 


879 


3 . 


687 


16.530 


-6. 


592 


1.00 


18.32 


A 


ATOM 


2026 


CB 


PRO 


A 


879 


5. 


116 


16. 050 


-6. 


374 


1.00 


19. 77 


A 


ATOM 


2027 


CG 


PRO 


A 


879 


5 . 


106 


15. 696 


-4 . 


907 


1. 00 


18 . 07 


A 


ATOM 


2028 


C 


PRO 


A 


879 


2 . 


727 


15. 362 


-6, 


801 


1. 00 


18 . 71 


A 


ATOM 


2029 


O 


PRO 


A 


879 


2 . 


135 


14 .846 


-5. 


849 


1.00 


17 . 95 


A 


ATOM 


2030 


N 
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CLAIMS 

I . An isolated binding pocket of a receptor tyrosine kinase (RTK) that regulates the kinase domain of the 
receptor tyrosine kinase. 

5 2. An isolated binding pocket as claimed in claim 1 wherein the RTK is an Eph receptor, preferably an 
EphB2 receptor. 

3. Molecules or molecular complexes that comprise all or parts of either one or more of a binding pocket 
as claimed in claim 1 or 2, or a homolog of the binding pocket that has similar structure and shape. 

4. A crystal comprising a binding pocket of an RTK that regulates the kinase domain of the RTK. 
10 5. A crystal as claimed in claim 4 wherein the binding pocket is in an autoinhibited state. 

6. A crystal comprising a juxtamembrane region and/or kinase domain of an RTK, or part thereof. 

7. A crystal formed by a juxtamembrane region and a kinase domain of an RTK in an autoinhibited state. 

8. A crystal comprising a binding pocket of an RTK that regulates the kinase domain of the RTK, in 
association with a ligand. 

15 9. A crystal comprising a binding pocket of an RTK as claimed in claim 1 or 2 complexed or associated 
with a ligand. 

10. A crystal as claimed in claim 9 wherein the ligand is a nucleotide or analogue thereof, a substrate or 
analogue thereof, a cofactor, and/or heavy metal atom. 

I I. A crystal as claimed in claim 9 wherein the ligand is a modulator of the activity of an RTK 

12. A crystal as claimed in any of the preceding claims wherein the shape and structure of the binding 
pocket is defined by one or more atomic interactions or enzyme atomic contacts in Table 2. 

13. A crystal comprising a binding pocket of an Eph receptor. 

14. A crystal comprising a binding pocket of an Eph receptor and a nucleotide or analogue thereof, from 
which it is possible to derive structural data for the nucleotide. 

25 1 5. A crystal according to any preceding claim wherein the Eph receptor is derivable from a human cell. 

16. A crystal according to any preceding claim, wherein the an Eph receptor is EphB2. 

17. A crystal according to any preceding claim wherein the crystal comprises a binding pocket of an Eph 
receptor having a mutation in the part of the enzyme which is involved in phosphorylation. 

18. A crystal according to any preceding claim wherein the crystal comprises a binding pocket of an Eph 
30 receptor having a mutation in one or more tyrosine residues. 

19. A crystal according to any preceding claim wherein the binding pocket is in association with a 
cofactor. 

20. A crystal according to any preceding claim having the structural coordinates shown in Table 3. 

21. A model of a binding pocket of an RTK made using a crystal according to any preceding claim. 

35 22. A model of: (a) a binding pocket of an RTK that is involved in maintaining an autoinhibited state or 
active state of an RTK or regulates the kinase domain of an RTK; and (b) a modification of the model 
of(a). 
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23. A model of a binding pocket of the present invention that substantially represents the structural 
coordinates specified in Table 3 

24. A computer-readable medium having stored thereon a crystal or model according to any of the 
preceding claims. 

5 25. A method of determining the secondary and/or tertiary structures of a polypeptide comprising the step 
of using a crystal or model according to any of the preceding claims. 
26. A method of screening for a ligand capable of associating with a binding pocket and/or mhibiting or 
enhancing the atomic contacts of interactions in a binding pocket, comprising the use of a crystal or 
model according to any of the preceding claims. 
10 27. A ligand identified by a method according to claim 26. 

28. A ligand identified by a method according to claim 26 that is a modulator capable of modulating the 
activity of the RTK. 

29. A method of identifying a modulator of an RTK comprising determining if a test agent inhibits or 
potentiates an autoinhibited state or active state of a kinase domain of the RTK. 

15 30. A method as claimed in claim 29 comprising one or more of the following additional steps: 

(a) testing whether the modulator is a modulator of the activity of a RTK, preferably testing the 
activity of the modulator in cellular assays and animal model assays; 

(b) modifying the modulator; 

(c) optionally rerunning steps (a) or (b); and 

20 (d) preparing a pharmaceutical composition comprising the modulator. 

31. A method of conducting a drug discovery business comprising: 

(a) providing one or more systems employing the atomic interactions, atomic contacts, or 
structural coordinates of a binding pocket of an RTK, for identifying agents by their ability 
to inhibit or potentiate the atomic interactions or atomic contacts of a binding pocket; and 

25 (b) conducting therapeutic profiling of agents identified in step (a), or further analogs thereof, for 

efficacy and toxicity in animals; and 

(d) formulating a pharmaceutical preparation including one or more agents identified in step (b) 
as having an acceptable therapeutic profile. 

32. A method of conducting a drug discovery business comprising: 

30 (a) providing one or more systems for identifying agents by their ability to inhibit or potentiate 

an autoinhibited state or active state of a kinase domain of an RTK; and 

(b) conducting therapeutic profiling of agents identified in step (a), or further analogs thereof, for 
efficacy and toxicity in animals; and 

(c) formulating a pharmaceutical preparation including one or more agents identified in step (b). 
35 as having an acceptable therapeutic profile . 

33. A method of conducting a target discovery business comprising: 

(a) providing one or more systems employing the atomic interactions, atomic contacts, or 
structural coordinates of a binding pocket of an RTK, for identifying agents by their ability to 
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inhibit or potentiate the atomic interactions or atomic contacts, or providing one or more 
systems for identifying agents by their ability to inhibit or potentiate an autoinhibited state or 
active state of a kinase domain of an RTK; 

(b) optionally conducting therapeutic profiling of agents identified in step (a) for efficacy and 
5 toxicity in animals; and 

(c) licensing, to a third party, the rights for further drug development and/or sales for agents 
identified in step (a), or analogs thereof. 

34. A method for regulating the kinase domain of an RTK by changing a binding domain or pocket of a 
RTK that regulates the kinase domain, from an autoinhibited state to an active state or from an active 

10 state to an autoinhibited state 

35. A method for inhibiting kinase activity of an RTK comprising mamtaining the RTK or a binding 
pocket thereof involved in regulating the kinase domain in an autoinhibited state, or potentiating an 
autoinhibited state for the RTK or binding pocket thereof involved in regulating the kinase domain. 

36. Use of a modulator according to any preceding claim in the manufacture of a medicament to treat 
1 5 and/or prevent a disease in a mammalian patient. 

37. A pharmaceutical composition comprising a ligand or modulator according to any preceding claim, 
and optionally a pharmaceutically acceptable carrier, diluent, excipient or adjuvant or any combination 
thereof. 

38. A method of treating and/or preventing a disease comprising administering a ligand, modulator, or 
20 pharmaceutical composition according to any preceding claim to a mammalian patient. 

39. A method of treating or preventing a condition or disease associated with an RTK in a cellular 
organism, comprising: 

(a) administering a pharmaceutical composition as claimed in claim 38; and 

(b) activating or inhibiting the RTK to treat or prevent the disease. 

25 40. A method for treating or preventing a condition or disease involving increased RTK activity 
comprising maintaining the RTK, or a binding pocket thereof involved in regulating the kinase 
domain of the RTK, in an autoinhibited state 
41. A crystal comprising an RTK binding pocket, substantially as described herein and with reference to 
the accompanying figures. 
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